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Protoctists and Microalgae: 

Antagonistic and Mutualistic Associations 
and the Symbiogenesis of Plastids 

Eberhard Schnepf 



1 Introduction 

Microscopical studies of associations of protoctists and microalgae have 
provided new insight into phytopathology and symbiogenesis of plastids 
also allowing a better understanding of fundamental problems of cell 
biology and cell evolution. Ecological problems and molecular biological 
findings are discussed marginally, lichens are not considered. 

The discovery of intracellular membrane systems has been of prime 
importance for cell biology and biochemistry. Endomembranes completely 
enclose various kinds of compartments in eukaryotic cells (Sitte 1998). 
Initially it was supposed that each compartment contained a special kind 
of cell plasma (Ruska 1960). Secretory processes in plant glands, based on 
membrane flow mechanisms (Schnepf 1961, 1969a) suggested a different 
interpretation of cell compartmentation. Electron microscopical investiga- 
tions in Geosiphon pyriforme (Schnepf 1964) and Glaucocystis (Schnepf et 
al. 1966) helped to establish a new concept which took into consideration 
also the two-membrane envelopes of chloroplasts and mitochondria: There 
are “plasmatic” compartments (cytoplasm s.str. and the nuclear matrix, the 
mitochondrial matrix, and the plastid stroma) and “nonplasmatic” com- 
partments (vacuoles, ER, Golgi cisternae and vesicles, microbodies as well 
as the spaces of mitochondrial and plastidal envelopes, of the mitochon- 
drial cristae and the thylakoid lumen). Biomembranes separate a plasmatic 
from a nonplasmatic compartment (Schnepf 1966). This compartmenta- 
tion rule has a few exceptions (Schnepf 1984, see also Cavalier-Smith 1993, 
2000, and his interpretation of the two-membrane envelopes) which will be 
discussed below, but a general meaning, as demonstrated, e.g., in the 
nomenclature of freeze-fractured membranes. 

The electron microscopy of Geosiphon and Glaucocystis initiated an 
understanding of the compartmentation of plastids, while also opening a 
door to understanding the phylogeny and evolution of chloroplasts and 
mitochondria. The idea of an endosymbiotic origin, first suggested by 
Schimper (1883) and Mereschkowsky (1905), largely disregarded at the 
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time, was more recently renewed (Schnepf 1966). The serial endosymbiosis 
theory is now widely accepted (Margulis 1981; Maier et al. 1996), supported 
by molecular biological data and by electron microscopical studies of 
existing associations between prokaryotic (cyanobacteria) or eukaryotic 
algae and protoctists. Genetic changes arising from endocytosymbiosis are 
by far greater than those arising from other intrinsic changes such as 
mutations or hybridization (Jeong 1983). 

Antagonistic and mutualistic associations are living models showing 
how chloroplasts can arise, and in fact evolve. The electron microscopy of 
antagonistic and mutualistic associations between microalgae and protoc- 
tists in respect to the interface between the two partners and its transfor- 
mation into chloroplast envelopes in conjunction with the various strate- 
gies of food uptake are the central issues of this article. 



2 Antagonistic Associations 

The terms predator/prey and parasite/host have been coined for antagonis- 
tic associations of multicellular animals and plants. A typical predator kills 
the prey, a typical parasite feeds on the host keeping it alive. It consumes 
the interest, while the predator consumes the capital. A clear distinction is 
difficult, even in multicellular organisms. A unicellular alga is usually, but 
not always, killed when attacked by a protoctist. Sommer (1994) therefore 
preferred the term “parasitoid” instead of “parasite”. Canter-Lund and 
Lund (1995) discriminated between grazers, organisms which digest the 
algae within themselves, and parasites, which digest the algae externally. 

Following largely Gaines and Elbrachter (1987) “parasite” is applied here 
for organisms which divide repeatedly after each feeding period whereas in 
“predators” feeding and cell division are not connected (see also Schnepf 
and Elbrachter 1992). A parasite is moreover frequently smaller than its 
host and its feeding process takes a long time (often in the range of hours 
or even days) whereas a predator is usually larger than the prey and it 
swallows the complete algal cell quickly and at once to digest it in a food 
vacuole. 

Parasitic eumycota and oomycetes generally digest the algae extracellu- 
larly and take up the nutrients by resorbing the molecules, a process which 
is not quite correctly named “osmotrophy”. Most algivorous protoctists 
engulf portions of the alga or whole cells by endocytosis and digest them in 
food vacuoles. Only those associations comprise models of chloroplast 
evolution. 
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2.1 Parasites Feeding by Resorption 

2.1.1 Endocytic Parasites 

Most endocytic “fungal” parasites of microalgae belong to oomycetes or 
chytrids (Sparrow 1960, Canter-Lund and Lund 1995). Lagenisma coscino- 
disci is perhaps the most extensively studied example among the oomy- 
cetes. It was detected by Drebes (1966) parasitizing the centric marine 
diatom Coscinodiscus. Lagenisma is diplanetic. A primary zoospore forms 
a primary cyst. It releases an isomorphic secondary zoospore which at- 
taches to a host cell and encysts there (Schnepf and Drebes 1977; Schnepf 
et al. 1978c,e). It germinates with a thin infection tube which penetrates the 
diatom frustule between the overlapping cingula and grows out into a thick, 
irregularly branched, non-septated, wall-less, multinucleate hypha (Drebes 
1966; Schnepf et al. 1978a). The host plasmalemma is pushed back, not 
pierced. The narrow space around the parasite thallus remains in contact 
with the periplasmic space of the diatom so that Lagenisma is endocytic but 
extrabiotic. The host plasmalemma is in close contact with the parasite 
plasmalemma. It disintegrates when the Coscinodiscus protoplast breaks 
down. 

Being an obligate, biotrophic parasite, Lagenisma then changes from the trophic into the 
reproductive phase.The thallus becomes covered by a cell wall and develops into a holocarp 
zoosporangium (Drebes 1966; Schnepf et al. 1978b, 1978e). The (primary) zoospores are 
released through a single discharge tube which opens apically (Drebes 1966; Schnepf et al. 
1978e). They seem to be driven out by the swelling of a mucilaginous material within the 
zoosporangium. The life cycle takes a few days. 

The sexual reproduction of Lagenisma differs considerably from that of other oomycetes 
(Schnepf and Drebes 1977; Schnepf et al. 1978b,d). In old, overpopulated cultures the last 
nuclear division is meiotic, not mitotic. The arising haploid swarmers resemble morpho- 
logically the diploid zoospores. Female-determined swarmers settle down near the host cell 
and encyst, forming an oogonium. Male-determined swarmers encyst close to an oogonium 
to form an antheridium. That drives a thin fertilization tube toward and into the oogonium. 
During plasmogamy the oogonium develops a thick, short hypha into which the fused 
cytoplasms with the two nuclei migrate. It is surrounded by a thick wall and becomes a 
resting spore (oospore) in which karyogamy takes place. The germination of a resting spore 
could not be observed as yet. 

An Olpidium sp., parasitizing the marine diatom Pseudonitzschia seems to 
be truly endobiotic (Elbrachter and Schnepf 1998), perhaps with a host- 
parasite interface of a single membrane, the parasite plasmalemma, as in 
Brassica cells infected with Olpidium brassicae (Lesemann and Fuchs 1970) 

Some endobiotic parasites belong to the dinoflagellates (Schnepf and 
Elbrachter 1992). Amoebophrya ceratii, a complex of host specific taxa 
(Coats and Park 2002), is an obligate parasite of dinoflagellates (Fritz and 
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Nass 1992; Maranda 2001). It has a very peculiar life cycle. A flagellate 
(dinospore) penetrates into a host cell and is then included in a periparasitic 
vacuole, situated within the nucleus or within the cytoplasm of the host. It 
becomes a round trophont and begins to rapidly grow. The girdle lengthens 
and the hypocone forms a cup-shaped circumvallation which finally en- 
closes the epicone. The inner surface of the parasite is covered by a typical 
amphiesma (plasmalemma plus a subsurface layer of flattened amphiesmal 
vesicles) and bears many flagella. The trophont becomes multinucleate. 
When at the end of the feeding phase the trophont largely fills the host cell, 
it everts into a vermiform stage, thereby enclosing portions of host cyto- 
plasm. The host-parasite interface consists hitherto of generally three mem- 
branes, the homology of which is unclear (Fritz and Nass 1992). During the 
main growth phase food vacuoles are absent. The nutrients are obviously 
taken up by resorption. The vermiform leaves the host cell and divides into 
hundreds to thousand new dinospores. 



2.1.2 Epicytic Parasites 

Many epicytic parasites (often named “epibiotic”) of microalgae feed by 
resorption. Most of them belong to the chytrids. A Chytridium sp., parasit- 
ising the chlorococcalean alga Scenedesmus armatus has been investigated 
by Schnepf et al. (1971b). A Chytridium zoospore attaches to a Scenedesmus 
cell, retracts the flagellum, encysts and drives a haustorium through the wall 
of the alga. The host deposits electron dense material around the perfora- 
tion site. Occasionally a “callosity” blocks the invasion. It consists of irregu- 
lar host wall material and includes remnants of host cytoplasm. It inhibits 
the further development of the parasite that consequently dies. 

A successful haustorium pushes the host plasmalemma back but does 
not pierce it. It is surrounded by the fungal wall and does not contain 
mitochondria. Accompanied by a small layer of host cytoplasm it may even 
penetrate through a chloroplast, the envelope of which is likewise invagi- 
nated but not pierced. 

The main body of the chytrid remains epicytic. It grows while the host 
cytoplasm degenerates. At the end of the feeding period a plug separates 
the then multinuclear cytoplasm of the developing epicytic zoosporangium 
from the haustorium. Subsequently the sporangial wall is thickened. A 
second, inner wall layer is deposited, and the zoospores are formed. 

A marine chytrid on the chain-forming diatom Bellerochea malleus 
resembles the Scenedesmus parasite in essential details (Schweikert 1997). 
Deposition of host material around the invading haustorium has not been 
observed here. The host protoplast does not retract locally, in contrast to 
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the freshwater diatom Asterionella formosa when attacked by the chytrid 
Rhizophydium planktonicum where the fungus becomes deprived of food 
and dies (Canter and Jaworsky 1979). 



2.2 Phagocytotic Predators 

Most predatory algivorous protoctists phagocytose complete algal cells by 
means of pseudopodia or ingest them through a cytostome. 

Amoeboid, plasmodial and heliozoan organisms surround a prey cell 
with a pseudopodium and enclose it in a food vacuole. This process depends 
on membrane flow mechanisms (Hausmann and Radek 1993). When the 
heliozoan Actinophrys sol catches the green flagellate Chlorogonium elon- 
gatum , the flagellate is attached to the surface of axopodia by means of the 
content of extrusomes (Sakaguchi et al. 1998). By the fusion of the ex- 
trusomes, the membranes are now transformed into the membranes of the 
food vacuoles (Hausmann and Patterson 1982). 

In the vampirellid amoebae, membrane stores seem to pre-exist which can be used rapidly 
for the formation of food vacuoles. The vampirellids open the cell wall of filamentous 
( Oedogonium ) and unicellular (Closterium) green algae obviously by local enzymatic activ- 
ity. Due to the turgor of the algal cells, part of the prey cytoplasm is explosively released 
when the cell is opened. The material is enclosed within a few seconds in the arising food 
vacuole which has a membrane area of more than 4000 pm 2 . Further portions of the algal 
cytoplasm are ingested by a pseudopodium which invades the cell while the main body of 
the predator is epicellular (Hausmann and Radek 1993). 

Various heterotrophic and mixotrophic heterokont flagellates and hapto- 
phytes likewise use pseudopodia to ingest algal cells. Surveys are given in 
the book of Patterson and Larsen (1991). They are supplemented here by a 
few further examples. The heterokont marine alga Reticulosphaera socialis 
consists of spherical cells which are connected by a common reticulopo- 
dium which is used to capture and digest diatoms. They can survive only 
for a restricted time period by photosynthesis alone (Grell 1989). Parapedi- 
nella reticulata is a heterokont, apoplastidic flagellate with long, thin 
filopodia. They adhere to small diatoms and are then retracted so that the 
prey can be enveloped and digested (Schnepf 1999). 

More sophisticated with very complex structures are the cytopharyngeal 
apparatuses in ciliates and euglenoids. Only a few examples can be pre- 
sented here. The ciliate Pseudomicrothorax dubius preys specifically on 
filamentous cyanobacteria. The algae are ingested through a cyto- 
pharyngeal basket consisting of hundreds of microtubules forming nema- 
todesmata and nematodesmal lamellae (Hausmann and Peck 1978). The 
algal filaments are ingested at rates of up to 15 pm per second. The force for 
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phagocytosis seems to be generated by actin filaments whereas the micro- 
tubules serve as skeletal elements. During food uptake the membrane of the 
food vacuole increases rapidly. Vacuole growth results from the fusion of 
vesicles containing hydrolytic enzymes which digest the wall of the cyano- 
bacterium in a matter of seconds (Hausmann and Peck 1979). Other ciliates 
ingesting cyanobacteria are less specific. They often require several hours 
to accomplish digestion of a single filament (Hausmann and Hiilsmann 

1996) . 

The ciliate, Climacostomum virens, uses oral membranelles to drive 
suspended food particles into its buccal cavity and ingests them periodically 
at the end of this tube. Up to 140 Chlorogonium elongatum cells can be taken 
up in 2-5 min (Fischer-Defoy and Hausmann 1981). 

Only a few euglenids are predators of unicellular algae though many 
species are heterotrophs. Their feeding apparatuses have been reviewed by 
Triemer and Farmer (1991). Peranema trichophorum employs two different 
modes of feeding. It ingests a wide array of particles, also living cells of the 
green euglenid Lepocinclis buetschlii or it uses the feeding apparatus to take 
up the prey protoplast in a myzocytosis-like process (Sect. 2.4) (Triemer 

1997) . 

Dinoflagellates have developed diverse nutritional strategies (Gaines 
and Elbrachter 1987; Schnepf and Elbrachter 1992). About half of the extant 
species are heterotrophs and many photosynthetic species are mixotrophic. 

Noctiluca scintillans (synon. N. miliaris) is an omnivorous dinoflagellate (Elbrachter and Qi 

1998) which ingests even large pieces of glass but also small algae. The prey is trapped at the 
tip of a long, mobile tentacle by means of mucilage. The tentacle then places the prey into 
the cytostome where a food vacuole, bounded by a single membrane, is pinched off (Nawata 
and Sibaoka 1983). Membrane hyperpolarization and transmembrane ion currents through 
the cytostome region precede the feeding process (Nawata and Sibaoka 1987). 

The athecate dinoflagellate, Oxyrrhis marina , feeds on a wide range of algae and is even 
cannibalistic (Dodge and Crawford 1974). The phagocytosis of a cell of Dunaliella bioculata 
takes not more than 10-15 s. A well-elaborated cytostome is lacking. During the ingestion 
the ventral peripheral microtubules are reversibly rearranged (Hohfeld and Melkonian 
1998). 

Even thecate dinoflagellates are able to take up other algae by the whole. The mixotrophic 
marine Fragilidium subglobossum feeds exclusively on Ceratium spp. (Skovgaard 1996). The 
sulcus plates move away from one another creating a longitudinal feeding gap along the 
sulcus plates. The theca of the Ceratium cell is dissolved early, in part even when the prey is 
not yet ingested completely. As the ingestion proceeds and the body of Fragelidium in- 
creases, the thecal plates become widely separated and the shape of the cell alters drastically. 
A Ceratium tripos cell is engulfed within 15 min, the smaller C. lineatum within 5 min. 
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2.3 Pallium Feeding 

Several dinoflagellates have overcome the problem to feed on cells much 
larger than themselves, by pallium feeding, first observed in Protoperid- 
inium by Gaines and Taylor (1984). Prey species belong to the dinoflagel- 
lates, chlorophycean, cryptophycean and prymnesiophycean flagellates 
and diatoms (Naustvoll 2000a). 

Jacobson and Anderson (1986) describe the feeding process as follows. 
The dinoflagellate swims about with a cytoplasmic peduncle, extruded near 
the flagellar grooves. If it encounters a prey it changes its swimming 
behaviour to move in tight circles. Eventually it makes contact with the tip 
of the peduncle. A pseudopodium then extends from the flagellar grooves 
along the peduncle which shortens pulling the prey closer to the predator. 
The pseudopodium enlarges and encloses the prey, usually single cells or 
chains of diatoms, as a feeding veil (pallium). Even large chains of cells with 
long spines are captured, e.g. a 600-pm-long chain of the diatom Chaeto- 
ceros within 15 min by the pallium of Protoperidinium spinulosum , the 
diameter of which was 50 pm. Eventually the prey is enclosed in a large food 
vacuole outside the theca and digested there, generally within less than 
30 min. Only digested material is transported into the cell body. At the end 
of the feeding phase the pallium is retracted and undigested material is 
released. 

The pallium is a vesiculate, membraneous sac containing several micro- 
tubular ribbons but neither mitochondria nor ER. The microtubules origi- 
nate from an internal microtubular basket and pass through a sphincter- 
like ring located inside the posterior flagellar groove (Jacobson and Ander- 
son 1992). The organization of the pallium strongly suggests that it is 
homologous with the feeding tube of other dinoflagellates (Sect. 2.4) albeit 
morphologically and physiologically very different. 



2.4 Peduncle Feeding 

Myzocytosis (Schnepf and Deichgraber 1984) is a special form of endocy- 
tosis observed in various epicytic parasitic dinoflagellates (reviewed in 
Schnepf and Elbrachter 1992). The prey protoplast is pierced by a cytoplas- 
mic feeding tube, the peduncle, which is homologous with the pallium (Sect. 
2.3) or by a phagopod. Peduncle and phagopod have not always been clearly 
distinguished in previous publications (Drebes and Schnepf 1998). The host 
cytoplasm is sucked up gradually and included in a food vacuole. The host 
plasmalemma is not taken up so that the ingested cytoplasm is separated 
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from the parasite cytoplasm by a single membrane, the membrane of the 
food vacuole. 

Peduncle feeding and the structure of the peduncle were first described 
in detail for Gymnodinium fungiforme (synon. Katodinium f.) (Spero and 
Moree 1981) and for Paulsenella (Drebes and Schnepf 1982; Schnepf et al. 
1985). 

The most extensively studied species is Paulsenella vonstoschii (in for- 
mer publications as P.cf. chaetoceratis or P. sp.), feeding on Streptotheca 
thamesis (newer name: Helicotheca t.). The P. vonstoschii dinospores (= 
flagellates) are chemotactically attracted by the host, presumably by short- 
lived components of its mucilage. These components are not released by 
egg cells, sperm cells and auxozygotes. The appetence of Paulsenella is light 
dependent (Schnepf and Drebes 1986). The zoospore attaches to the girdle 
region and drives a cytoplasmic peduncle through the overlap of the cingula 
into the host cell. The peduncle emerges near the flagellar grooves through 
a sphincter of striated filaments. Its basis is surrounded by a cellulosic 
sheath, the distal part is mobile. It pierces the host plasmalemma the rim 
of which becomes closely attached to the apical opening of the peduncle. It 
retains its integrity elsewhere though it shrinks rapidly during the uptake 
process. The host vacuoles contract and the protoplast is sucked out within 
less than one hour (Drebes and Schnepf 1982; Schnepf et al. 1985). When 
an extremely large Streptotheca cell is attacked, the parasite is unable to 
ingest the whole cell contents. The cell recovers if the host nucleus is left 
intact - a true case of parasitism. 

The peduncle is formed by the emergence of a preformed microtubular 
basket consisting of bands of microtubules. It is a thin cytoplasmic tube 
bounded by a single membrane, the plasmalemma, which continues into 
the inner tube membrane and the membrane of the developing food vacu- 
ole. It is stiffened by the bands of microtubules. The elongation of the 
peduncle and its movement within the host cell seems to be driven by the 
sliding of the bands and of the microtubules within the bands. Before its 
protrusion the microtubular basket is characteristically associated with 
long electron-transparent and long electron-dense vesicles, as in the micro- 
tubular basket that forms a pallium (Sect. 2.4). 

Paulsenella chaetoceratis invades the interior of Chaetoceros cells 
through a yet non-silicified tip of the setae. The feeding tube may here reach 
a length of more than 100 pm (the dinospores are 12-15 pm long). Chaeto- 
ceros chloroplasts pass the narrow feeding tube of P. chaetoceratis , within 
a few seconds being deformed drastically from a lense-shaped organelle 
3.5 pm in diameter and 1.8 pm thick to a cylinder 8 pm long and 1.8 pm 
thick. Within the food vacuole the chloroplasts regain their previous shape 
and internal structure. Their envelope is not ruptured. Even the periplasti- 
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dal ER (“chloroplast ER”, Sect. 6.2) is partially preserved (Schnepf et al. 
1988). At the end of the feeding process the peduncle shortens, collapsing 
apically, and is retracted within a few seconds. Digestion does not begin 
before food uptake has been concluded completely. The ingested cytoplasm 
and the chloroplasts therein appear to be structurally intact (perhaps also 
active) for more than an hour. The trophont encysts during digestion and 
divides repeatedly to eventually give rise to dinospores. The life cycle is 
concluded within about a day. Sexual processes have been observed but the 
knowledge on the details is still incomplete (Drebes and Schnepf 1988). 

Myzocytosis by means of a typical peduncle was observed also in a Cypthecodinium cohnii- 
like dinoflagellate preying on the small unicellular red alga Porphyridium sp. (Ucko et al. 
1997). The cell is sucked out within 10-30 s, leaving an empty, soon disintegrating cell wall. 
The peduncle is retracted after each uptake process. Actin seems to be involved in the suction 
and the retraction of the peduncle. Up to 20 Porphyridium cells are taken up until the 
dinoflagellate encysts and divides. 

A 66-kDa glycoprotein in the Porphyridium cell wall is the recognition site for the 
dinoflagellate which contains enzymes that degrade the cell wall complex of this alga but 
not that of other rhodophytes. The dinoflagellate is prevented to prey when the algal cells 
are blocked with antiserum specific to the 66-kDa glycoprotein or the lectin con A (Ucko et 
al. 1999). 

Peridiniopsis berolinensis feeds on cryptophytes but also on nematodes and rotifers, with 
a typical peduncle which is also used to take up particulate food and whole algal cells and 
is, therefore, not a myzocytotic apparatus in the narrow sense (Calado and Moestrup 1997). 



2.5 Phagopod Feeding 

Phagopod feeding was first described in detail for the dinoflagellate Am - 
phidinium cryophilum (Wilcox and Wedemayer 1991). The phagopod is a 
hollow cylinder that extends from the antapex. In A. cryophilum it consists 
of electron-opaque material that is possibly deposited on a membrane but 
is no longer cytoplasmic when in function. In contrast to a peduncle it does 
not pass through a sphincter. The prey cells, other dinoflagellates, are 
sucked out within 10 min or more. The food is deposited in a nascent 
vacuole. After feeding the phagopod is left in the prey cell. The peduncle of 
A. cryophilum is not involved in food uptake. 

The phagopod of A. cryophilum has a wide apical opening, in contrast to 
that of Gyrodinium undulans , a dinoflagellate that feeds on the marine 
diatom Odontella aurita. No other diatoms are accepted but copepod and 
rotifer eggs are also sucked out (Drebes and Schnepf 1998). A G. undulans 
flagellate may attach to any site of the diatom frustule. It sticks with its 
antapex and protrudes a short phagopodium which broadens to a flat 
appressorium exactly over an areolar chamber. The frustule is pierced 
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through the areole which has a diameter of only 0.1-0.25 pm. Within the 
diatom the phagopod is a 2-pm-wide and up to 60-pm-long cellulosic tube. 

The Odontella protoplast contracts but is eventually pierced by the tip 
of the phagopod which then opens to form a narrow funnel. The cytoplas- 
mic fluid and perhaps small organelles are sucked up. The chloroplasts are 
too large to enter the funnel. They thus cannot plug the bottle neck where 
the phagopod passes through the areola. The feeding phase takes 1.5-3 h 
and changes from myzocytosis to phagocytosis when the host plas- 
malemma is broken down. The phagopodium and an aggregation of discol- 
oured chloroplasts are left behind when the ingestion is finished and the 
dinoflagellate detaches and divides. 

Myzocytosis-like processes do not seem to be restricted to dinoflagellates. The kinetoplastid 
flagellate Bodo bacillar iophagus parasitizes on the diatom Navicula extending a delicate 
“haustorium” through the raphe into the cell. It restricts itself to sucking up the nucleoplasm 
(Bursa 1963). The euglenoid Peranema trichophorum feeds phagotrophically and myzocy- 
totically (Triemer 1997). Some ciliates belonging to the Colpodea have a feeding apparatus 
that structurally and functionally resembles a dinoflagellate peduncle (Foissner and Didier 
1984) as do the suctorian tentacles (Bardele 1972). 



2.6 Epicytic Parasites Feeding with Pseudopodia 

Diatoms are well protected by their frustule against the attack of small 
parasites or predators. Peduncle feeding, phagopod feeding and pallium 
feeding are ways to overcome this barrier, special pseudopodia may also 
serve towards these means. 

An epicytic amoeba with an endocytic, phagocytic pseudopodium is Rhizamoeba schnepfii 
(Kiihn 1996/97). It is obligately algivorous and feeds on various marine diatoms. The 
pseudopodium penetrates into the frustule between the girdle bands and gradually phago- 
cytizes portions of the host protoplast. The food vacuoles are transported into the main body 
which remains outside the frustule. Within a few minutes up to a few hours most of the 
diatom protoplast is consumed. Only life diatoms are fed upon. Intact frustules of Coscino- 
discusgranii and Odontella sinensis cannot be penetrated but phagocytosis on damaged cells 
is possible. 

The apoplastidic nanoflagellate Pirsonia guinardiae is an epicytic parasite 
of the marine diatom Guinardiaflaccida. It feeds in a unique mode (Schnepf 
et al. 1990). Further recently detected Pirsonia species are distinguished by 
their morphology, their development and the host range (Kiihn et al. 1996; 
Schweikert and Schnepf 1997b). Electron microscopy demonstrates that 
Pirsonia belongs to the heterokonts (stramenopiles) (Schnepf and Schweik- 
ert 1996/97). 
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The mobile stages are small (generally about 5x10 pm ), naked flagel- 
lates. They attach to a host cell with the posterior cell pole, guided by 
chemotaxis as well as by the topography of the frustule surface to host-para- 
site specific sites (Schnepf and Schweikert 1996/97; Kuhn 1997). The attach- 
ment depends on an intact actomyosin system. An antapical pseudopo- 
dium penetrates into the frustule. The main body remains epicytic and 
becomes the “auxosome”. The flagella wind around the cell apex and are 
retracted in most species. The auxosome does not encyst. Inside the diatom 
the pseudopodium develops into a “trophosome”, consisting of a single, 
enlarging proximal digestion vacuole and distal pseudopodia which phago- 
cytize portions of host cytoplasm and transport them into the digestion 
vacuole. The thin connection between trophosome and auxosome is not 
passed by particulate host material. The diatom reacts by shrinking of the 
vacuoles and/or systrophical movement of cytoplasm and chloroplasts 
toward the infection site. The feeding phase takes many hours. 

Electron microscopy revealed that the host plasmalemma remains un- 
interrupted during the feeding process. Freshly phagocytized portions of 
host cytoplasm are still bounded by their plasmalemma when included in 
a food vacuole. The trophosome cytoplasm does not contain larger organ- 
elles such as mitochondria, only some fibrillar structures and a few vesicles 
(Schnepf and Schweikert 1996/97). 

Under culture conditions an already attacked but still living cell is 
usually more attractive than an uninjured one. It seems to release more 
chemotactically effective, specific substances than healthy cells. The tro- 
phosomes of adjacent auxosomes may fuse to form a common digestion 
vacuole (Kuhn et al. 1996). Adjacent sister cells are obviously recognized as 
“self”. 

The auxosome begins to divide already during phagocytosis and digstion. The arising 
daughter cells continue to divide until the direct or indirect contact with the trophosome is 
lost. They then become flagellate mother cells which divide again one or two times to give 
rise to new flagellates. Up to about 60 offspring may result from a single infection (Schweikert 
and Schnepf 1997b). 

A Guinardia flaccida cell may survive the attack of a single P. guinardiae because the 
nucleus is often not ingested. In plankton samples only about one third of the infested cells 
were found to be killed (Schnepf et al. 1990). 

The genus Cryothecomonas comprises nanoflagellates with two smooth, 
heterodynamic, apically inserting flagella of unequal length. The cell is 
surrounded by a delicate, close-fitting theca. The theca is multilayered and 
contains a fribrillar, presumably non-cellulosic polysaccharide. Some spe- 
cies have extrusomes (Schnepf and Gold 2000; Schnepf and Kuhn 2000). 
Analyses of the small subunit rRNA indicate that Cryothecomonas belongs 
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to the Cercozoa and is related with the sarcomonad flagellate Heteromita 
globosa (Kuhn et al. 2000). Cryothecomonas aestivalis (Drebes et al. 1996) 
(Sect. 2.7) and C. longipes (Schnepf and Kuhn 2000) are parasites of marine 
diatoms. 

An attacking C. longipes protrudes a broad pseudopodium through a 
ventral slit of its theca. It penetrates the frustule between overlapping 
cingula, pinches off portions of the diatom protoplast and transports the 
food vacuoles into the epicytic trophont. The pseudopodium may branch 
and reach a length of more than 30 pm when the host protoplast shrinks 
and retracts. It contains a largely ribosome-free mass of fine fibrils, some 
vesicles and a few microtubules in its proximal part. When it emerges 
special flat vesicles are incorporated into the plasmalemma at the basis of 
the pseudopodium to enlarge it (Schnepf and Kiihn 2000). The food uptake 
can take more than 3 h and is finished when the diatom protoplast begins 
to disintegrate. The trophonts divide after feeding. 



2.7 Endocytic Phagotrophic Parasites 

Cryothecomonas aestivalis belongs to a group of parasitic protoctists which 
completely invade the host cell gradually phagocytosing the host cytoplasm 
but remaining extrabio tic. The organism was detected in 1993 feeding on 
the marine diatom Guinardia delicatula (Drebes et al. 1996). When a 
flagellate has traced a G. delicatula chain, it slides along its surface and scans 
it with the tip of the anterior flagellum. Eventually it attaches with its basal 
pole, generally close to the overlap between valva and cingulum. Becoming 
slightly amoeboid it squeezes through the frustule, inserting the posterior 
first. Inside the diatom the flagella shorten. The invasion takes 5-10 min. 

The trophic phase begins with the emergence of a pseudopodium from 
the basal part of the flagellate. The theca has a gap here. The pseudopodium 
gradually phagocytizes portions of the diatom protoplast, contracting 
rhythmically. At the contact zone with the diatom protoplast, the pseudo- 
podium has a fibrillar-granular appearance. Many vesicles and invagina- 
tions of the plasmalemma indicate a high rate of membrane flow. Mitochon- 
dria are not found here. The trophont grows considerably. It fills a large 
part of the host cell when the protoplast has been completely ingested. The 
food uptake comes to an end when the Guinardia plasmalemma is rup- 
tured. The feeding phase takes about 4 h and is followed by multiple binary 
divisions. Usually 8 (but up to 32) flagellates arise. 

Examples for endocytotic phagotrophic flagellates that feed with a well-elaborated cy- 
tostome are the euglenoid Rhynchopus coscinodiscivorus and the bodonoid (kinetoplastid) 
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Hemistasia phaeocysticola. R. coscinodiscivorus feeds on the large, marine diatom Coscino- 
discus concinnus (Schnepf 1994a). It has a subapical flagellar pocket with two concealed, 
rudimentary flagella and a subapical cytostome. Cell apex, cytostome, cytopharynx and 
flagellar pocket are reinforced by microtubules and a band of rods. The mitochondria have 
conspicuously poorly developed cristae. 

The flagellate tears off pieces of the retracted host protoplast with its cytostome by jerking 
contractions of the cell apex. They are transported through the cytopharynx and added to 
one of the digestion vacuoles. Cell debris is eventually also ingested. R. coscinodiscivorous 
divides as long as food is available. A single host cell can provide food for up to 100 offspring. 
It takes several days until a Coscinodiscus cell is consumed. 

Sometimes confused with R. coscinodiscivorus (both are also incorrectly named u Pronoc- 
tiluca”) was Hemistasia phaeocysticola. The latter has a typical mitochondrion of the 
polykinetoplastic type (Elbrachter et al. 1996). The two flagella insert in a deep, subapical 
pocket. The apical rostrum, flagellar pocket, cytostome and cytopharynx are reinforced by 
various cytoskeletal elements. Food is ingested phagocytotically mainly from weakened or 
decaying organisms, preferentially diatoms, and accumulated in a single, large digestion 
vacuole. Feeding and division are not separated. 

Amoeboaphelidium and Aphelidium are endocytic, extrabiotic parasites of 
chlorococcalean algae. Diverse species of Amoeboaphelidium have been 
studied with respect to their host range, their mode of food uptake, their 
development and their fine structure (Gromov and Mamkaeva 1968, 1970). 
A similar parasite is Aphelidium (Schnepf et al. 1971a; Schnepf 1972). 
Amoeboaphelidium propagates itself by amoebae, Aphelidium by zoo- 
spores with a single opisthokont flagellum. Their taxonomic position is 
unclear. 

Aphelidium cf. chlorococcarum is parasitic on Scenedesmus. A zoospore 
attaches to a host cell, encysts, and penetrates the cell wall. The alga reacts 
with the formation of a tube-like channel of cell wall material around the 
invasion site. The naked Aphelidium invades into the algal protoplast the 
plasmalemma of which is not pierced but structurally modified at the 
host/parasite interface. The periparasitic space remains in connection with 
the infection channel. 

The food vacuoles are initially bounded by two membranes closely 
sticking together. The outer one belongs to the parasite, the inner one to 
the alga. The latter eventually disintegrates. Closely sticking parasite and 
host membranes (“compound membranes”) are found also elsewhere at the 
host/parasite interface, especially in the region around the infection chan- 
nel (Schnepf 1972). The amoeba develops into a multinucleate plasmodium 
which eventually cleaves into 10-20 zoospores. A sporangial wall is not 
formed. Under certain conditions a thick-walled resting spore arises in- 
stead. 

Another, somewhat similar organism is Pseudaphelidium drebesii , a rare 
parasite of the marine diatom Thalassiosira punctigera (Schweikert and 
Schnepf 1996, 1997a). Its development differs from that of Aphelidium . 
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Pseudaphelidium has a unique form of mitosis. The nuclear envelope 
remains closed and the spindle microtubules are extranuclear. In 
Aphelidium and Amoeboaphelidium the details of mitosis are unknown as 
yet. 

The motile, infective stages of P. drebesii are small zoospores with a 
single, opisthokont flagellum. They attach to a host cell, retract the flagel- 
lum and encyst. The cyst contains a long, invaginated infection tube. It is 
internally bounded by a continuation of the plasmalemma and opens over 
a gap in the cyst wall, positioned at overlaps in the diatom frustule. The 
infection tube everts, penetrates through the frustule and creates an open- 
ing through which the P. drebesii protoplast enters the cell lumen. The 
Thalassiosira plasmalemma is pushed back. The parasite is endocytic but 
extrabiotic. Portions of host cytoplasm are included in food vacuoles which 
fuse to form a large, central digestion vacuole, surrounded by the then 
multinucleate parasite plasmodium. Host and parasite are separated by 
their plasma membranes. The phagocytotic areas of the plasmodium con- 
tain bundles of microfilaments. 

At the end of the trophic phase the plasmodium is a hollow sphere and 
fills the frustule completely. It cleaves to form amoeboid cells which soon 
encyst. The cyst releases four, sometimes less zoospores. 

Phagomyxa algarum was found by Karling (1944) as an endoparasite of 
filamentous brown algae. He placed it in the Plasmodiophorida though it 
feeds phagocytotically, in contrast to the other members of this taxon which 
take up their nutrients by resorption. A similar organism, Phagomyxa 
bellerocheae (Schnepf et al. 2000), is a frequent endoparasite of the marine 
diatom Bellerochea malleus (Schnepf 1994b). A related species is P. odon- 
tellae , feeding on the marine diatom Odontella sinensis (Schnepf et al. 
2000 ). 

Electron microscopical investigations of the zoospores and their flagellar apparatus con- 
firmed that Phagomyxa is related with the other Plasmodiophorida (Schnepf 1994b). Analy- 
ses of the SSUrRNA revealed that Phagomyxa and Plasmodiophorida form a monophyletic 
clade. It clusters most closely with a rhizopod assemblage consisting of sarcomonads and 
chlorarachniophytes (Bulman et al. 2001). The Phagomyxa sequences appeared distant 
enough from those of the Plasmodiophorida to justify a separate order, the Phagomyxidae. 

The first infection stages of Phagomyxa are not yet known because the two 
species could not be cultivated. P. bellerocheae forms a plasmodium, a 
hollow sphere surrounding a large digestion vacuole, within the host pro- 
toplast. The structure of the host/parasite interface is not yet known. 

The plasmodia of P. bellerocheae cleave when the host material is con- 
sumed. A single sporangiosorus is formed, consisting of numerous zoospo- 
rangia. The zoosporangia are surrounded by a thin cell wall and release tiny 
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zoospores (2.5x4 pm). P. odontellae plasmodia form peripheral, three-di- 
mensional networks within the host cell, with numerous small food vacu- 
oles and develop several sporangiosori (Schnepf et al. 2000). 



2.8 General Aspects 

In spite of the great diversity of predators and parasites some general 
conclusions are possible. Parasites are usually host specific, rather than 
predators, especially those predators which are much larger than their prey. 
Noctiluca , a big cell, is an example for omnivory; ciliates acquiring their 
food by filter feeding are another example. In those cases (Sherr et al. 1991) 
as also, e.g., in some naked dinoflagellates (Naustvoll 2000b) the size of the 
particles is the determinative factor. There are, however, also highly selec- 
tive predators. The ciliate Pseudomicrothorax dubius can ingest its prey, 
filamentous cyanobacteria, only by immediate digestion of the cell walls by 
specific enzymes (Hausmann and Peck 1979). An omnivorous feeder has a 
better chance of acquiring food than a specialist but it has to be equipped 
with a broader array of digestive enzymes. Most predators therefore possess 
sensory prey-detecting mechanisms (Sibbald et al. 1988). 

Pallium feeders have a restricted choice of prey organisms (Buskey et al. 
1994), the food preference being influenced by the size of the prey 
(Naustvoll 2000a). 

Small parasites need to be clever. They have to find access to their hosts. 
The host range may indicate taxonomic relationships (Cook 1963) and can 
be extremely narrow. There are chytrids which discriminate between mor- 
phologically similar strains (“demes”) of what has traditionally been con- 
sidered single diatom species. Even individual clones exhibit different 
degrees of susceptibility (Mann 1999). - An interesting curiosity is Gy- 
rodinium undulans . It feeds on a single diatom species, Odontella aurita , 
but also on eggs of rotifers and copepods (Drebes and Schnepf 1998). 

The infectivity of certain parasites can depend on light (Schnepf and 
Drebes 1986; Bruning 1991). The incidence of preying and of infections may 
be influenced by the degree of turbulence (Kuhn and Hofmann 1999) which 
favours random collisions on the one side but destroys gradients of chemi- 
cal signals around the prey or host on the other side. 

Chemotaxis is the most important mechanism for long-distance attrac- 
tion (Spero 1985). Chemosensory stimulation by algal exudates can change 
the swimming pattern of flagellates and zoospores (Schnepf and Drebes 
1986; Kuhn 1997). Already attacked cells are especially attractive, indicating 
that they release more attractants than uninjured ones (Kuhn 1998). 
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In contact recognition, nanoflagellates frequently use the tip of the 
anterior flagellum (Drebes et al. 1996; Kiihn et al. 1996). Cannibalism is 
generally avoided but possible. The chemical as well as the morphological 
topography of the host cell surface guides the parasite to find a site for 
invading (Schnepf and Schweikert 1996/1997; Kiihn 1997). Surface charges 
also seem to play a role in food recognition (Hammer et al. 1999). 

Gap-free host cell walls can be digested (Ucko et al. 1999); sometimes 
even locally (Hausmann and Radek 1993). Diatom frustules are usually 
invaded through overlaps of the shell or through openings such as rimopor- 
tulae. If a parasite fails to attach and to encyst at a proper site it will perish 
(Schweikert and Schnepf 1997a). 

An invading obligatory biotrophic parasite does not penetrate through 
the host plasmalemma but pushes it back. The feeding phase comes to an 
end when the host plasmalemma disintegrates. The interface between a host 
and an extrabiotic parasite thus consists always of the two plasma mem- 
branes. Haustoria are additionally surrounded by the fungal cell wall which 
is not present in intracellular, extrabiotic oomycetes like Lagenisma until 
the thallus becomes a zoosporangium. 

Parasites which feed myzocytotically pierce the host plasmalemma the 
rim of which becomes closely attached to the opening of the feeding tube 
or phagopod. When the host cell is sucked out, the surface of the protoplast 
decreases rapidly. It is still unexplained how the plasmalemma is reduced, 
for it is not included in the food vacuole, where host and parasite cytoplasm 
are separated only by the membrane of the food vacuole. 

Curiously the pallium, the dinoflagellate peduncle, the Pirsonia tro- 
phosome, and the feeding parts of Cryothecomonas pseudopodia do not 
contain mitochondria though the uptake processes surely require ATP and 
are, at least in part, driven by the actomyosin system. 

The host/parasite interface of some endobiotic parasites consists of a 
single membrane, the parasite plasmalemma. The host membrane is lost in 
these cases. 

Eumycotes and oomycetes digest their food extracellularly. It is un- 
known how the digestive enzymes of biotrophic parasites pass the two 
membranes at the host/parasite interface and how the nutrients are resor- 
bed. Digestion vacuoles outside the cell body proper are formed by Pirsonia 
and by pallium feeding dinoflagellates. 

Only a few defense mechanisms are known as yet (Verity and Smetacek 
1996; Wolfe 2000). Ingestion by filter feeders like ciliates is impeded by 
spikes ( Chrysochromulina ), the formation of coenobia ( Scenedesmus ) or 
colonies ( Phaeocystis ) (Sournia 1982; P.J. Hansen et al. 1995; Heckmann 
1995). These defense mechanisms are ineffective against infections by 
parasites. The frustule of a diatom protects against the attacks of invasive 
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parasites (Kuhn 1996/97) as does the skin of the Phaeocystis colony 
(Smetacek 1999). In the diatom Thalassiosira rotula a wound-activated 
chemical defense has been detected (Pohnert 2002; further examples 
see:Verity and Smetacek 1996). Effective against the propagation of biotro- 
phic parasites are hypersensitive reactions (Canter and Jaworsky 1979). 



2.9 Ecological Impact and Epidemiology 

Protoctists feeding on plankton algae play an important role in the biotic 
control of primary production and of carbon flux through marine and 
freshwater food webs, a role that was underestimated for a long time. It was 
commonly believed that “big ingest small” (e.g., copepods/euphasids feed- 
ing on diatoms) (Smetacek 1999). The detection of many algivorous pro- 
toctists has changed this inaccurate perception, the more because it is now 
recognized that many photosynthetic protoctists also feed on plankton 
algae (Stoecker 1998). Bockstahler and Coats (1993a) found up to 30% of 
the pigmented dinoflagellates with food vacuoles containing ciliates, dia- 
toms and dinoflagellates. The mixotrophic Gymnodinium sanguineum was 
estimated to remove ciliate biomass by up to 67% of the standing crop, daily 
(Bockstahler and Coats 1993b). 



A few selected data may illustrate the ecological impact of algivorous protoctists. Heterotro- 
phic dinoflagellates feeding by peduncle or pallium were most abundant in the Kattegat 
during periods characterized by large phytoplankton blooms (diatoms and dinoflagellates). 
During these periods the heterotrophic biomass corresponded to between 13-37% of the 
phytoplankton biomass (P.J. Hansen 1991). In the Oslo^ord the Protoperidinium biomass 
may reach nearly 40% of the algal carbon biomass in the form of diatoms and phototrophic 
dinoflagellates (Kjaeret et al. 2000). Heterotrophic protoctists grazed off 20-100% of the 
daily primary production in antarctic coastal waters (Archer et al. 1996). 

Paulsenella on Streptotheca reached infection rates of up to 20% (Drebes and Schnepf 
1982), Gyrodinium undulans on Odontella aurita of up to 85% (Drebes and Schnepf 1998). 
Guinardia delicatula and G.flaccida were infected of up to 35% and 65%, respectively, by 
Pirsonia and Cryothecomonas (Tillmann et al. 1999). During abloom of Palmeria hardmania 
in Jamaica, up to 90% of the diatom cells became parasitized by Lagenisma (Grahame 1976). 

In the Schohsee, a mesotrophic lake near Plon (Germany), epidemics of 
chytrids on plankton algae occurred throughout the year. Parasites ap- 
peared at population densities as low as 1 cell ml -1 in some species, with 
infection rates sometimes exceeding 80%. The proportion of the total 
phytoplankton biovolume infected by fungi was usually less than 1% but 
occasionally reached 10%. Parasitism was highly species specific with one 
parasite species usually infecting one host species (Holfeld 1998). Because 
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of the specificity, infection of one algal species favours successional devel- 
opment of other, competing species (Tillmann et al. 1999). 

The dynamics of parasite populations depends on the population of 
possible hosts. Lagenisma coscinodisci was common in the German Bight 
up to the 1980s but is rare now (H. Halliger, List, pers. comm.). The decrease 
coincides with the spreading of a newly introduced Coscinodiscus species, 
C. wailesii. The latter is by far less accepted as host as the other species. 

During a host algal bloom its parasite finds optimal conditions. Follow- 
ing the outbreak of a red tide of the photosynthetic dinoflagellate Gymno- 
dinium mikimotoi (in European waters misidentified as Gyrodinium 
aureolum= Gymnodinium aureolum is now also called Keramia mikimotoi , 
Schnepf and Elbrachter 1999; G. Hansen et al. 2000), abundance and growth 
rates of heterotrophic dinoflagellates increased rapidly. Grazing by 
dinoflagellates effectively contributed to the disappearance of the red tide. 
The ciliate population did not respond (Nakamura et al. 1995). Due to their 
longer generation times multicellular zooplankters can “miss” blooms and 
are less successful in reducing them than protoctists. - The ecological 
impact of algal parasites is influenced also by predators of the parasites and 
by hyperparasites (example of hyperparasitism, see Kuhn and Schnepf 
2002 ). 



3 Kleptochloroplasts 

Kleptochloroplasts (Schnepf et al. 1989) result from an endocytic uptake of 
alien cytoplasm containing chloroplasts which are ultimately retained and 
used as photosynthetic organelles for a restricted period of time. The other 
components of the ingested cell inclusive of the nucleus are usually digested 
much prior to the digestion of the chloroplasts. Kleptoplastidy takes an 
intermediate position between antagonistic and mutualistic, symbiotic 
associations. The enslaved chloroplasts deliver photosynthetic products 
into the feeder which in turn support the chloroplasts to allow their long 
survival. The acquisition of kleptochloroplasts converts a heterotroph into 
a mixotroph. This occurrence is widespread among protoctists (Stoecker 
1998) and has been known for a long time from marine molluscs (opistho- 
branchs) (recently reviewed by Rumpho et al. 2000). 

Dinoflagellates take the kleptochloroplasts generally from cryptophytes. 
Investigations of Gymnodinium aeruginosum (Schnepf et al. 1989) and the 
perhaps conspecific G. acidotum (Farmer and Roberts 1990, Fields and 
Rhodes 1991) as well as of Amphidinium poecilochroum (Larsen 1988) 
revealed the following course of events: 
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The dinoflagellates in question are initially colourless and contain a 
single dinokaryotic nucleus. They can survive in this state for only a 
restricted period of time. Various cryptophytes are ingested myzocytoti- 
cally. Horiguchi and Pienaar (1992) found three types of cryptophytes in 
Amphidinium latum . The naked cryptophycean cytoplasm without plas- 
malemma, periplast and flagellar apparatus is included in a vacuole. The 
algiphorous vacuole is irregular in shape, the chloroplasts are highly lobed. 
After some time, presumably in the range of days, the cryptophycean 
nucleus and the nucleomorph (see Sect. 5.2.2) disappear. Presumably they 
are selectively digested. In wild populations of G. acidotum only 30% 
contained a cryptophycean nucleus (Farmer and Roberts 1990), 10% in G. 
aeruginosum (Schnepf et al. 1989). 

G. acidotum becomes colourless in pure culture 10-14 days after feeding 
on cryptophytes and dies. In co-cultivation with the cryptophyte Chroo- 
monas it has been maintained over 9 months. Starch grains in the dinophy- 
cean cytoplasm indicate that the kleptochloroplasts are functional and that 
carbohydrates are exported into the dinoflagellate. The photosynthetic 
activity of the kleptochloroplasts of Gymnodinium gracilentum is lost 
within a few days (Skovgaard 1998). 

In Amphidinium vigrense cryptophycean-like chloroplasts were found 
to be bounded by three membranes (Wilcox and Wedemayer 1985); it was 
suggested that the chloroplasts had become permanent residents. A. vi- 
grense was, however, described as colourless, feeding on cryptophytes 
(Woloszynska 1925). If chloroplasts are present, their number is highly 
variable. Kleptoplastidy is probable also here. Variable chloroplast num- 
bers are known also from several other dinoflagellates. Kleptoplastidy 
seems to be much more common in dinoflagellates than initially thought. 

Ciliates receive their kleptochloroplasts from a variety of algae. Even a 
single cell may contain a mixture of plastid types (Stoecker 1993, 1998). In 
the Strombidiidae (Oligotricha) plastid retention is common. In marine 
waters about 40% of the ciliate cells in the euphotic zone are “plastidic” 
strombidiids (Stoecker et al. 1987). Mixotrophic ciliates comprised 88% of 
the total ciliate biovolume during summer and fall (Holen 2000). 

Plastid retention is a stable feature in some oligotrich ciliates. Laboea 
strobila is obligately mixotrophic and does not grow in the dark even when 
suitable food is present (Stoecker et al. 1988). Photosynthesis has been 
demonstrated (Stoeker 1993). The half life of Strombidium kleptochloro- 
plasts is between 12 h and some days. 

Kleptochloroplasts in foraminiferans are received only from chromo- 
phytes (Lopez 1979). In centrohelid heliozoans (Patterson and 
Durrschmidt 1987) they are derived from different algae. Chlorophytic and 
chromophytic chloroplasts can be found in one and the same cell, together 
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with digestion vacuoles, either lying free in the heliozoan cytoplasm, sur- 
rounded by only the two membranes of the chloroplast envelope, even in 
the case of chromophytic origin, or additionally by a membrane provided 
by the heliozoon. 

The ciliate Mesodinium rubrum (= Myrionecta rubra) has cryptophyte- 
derived chloroplasts representing a special kind of kleptoplastidy. The 
details of this association are not yet well understood. M. rubrum has 
neither a cytostome nor a cytoproct. It contains numerous vacuoles with 
usually a chloroplast, mitochondria, not always a microbody, but neither 
dictyosomes nor elaborated ER. They are bounded by a single membrane. 
The chloroplast-mitochondrial complexes are in general not connected 
with a portion of endosymbiont cytoplasm containing a cryptophyte nu- 
cleus. Digestion vacuoles have not been found (Hibberd 1977, Lindholm et 
al. 1988). It is believed that M. rubrum is permanently associated with an 
endosymbiotic cryptophyte. 

Only recently has it become possible to cultivate M. rubrum (Gustavson 
et al. 2000). Surprisingly it turned out that, when cultivated unfed for 28 
days, the ciliate became colourless and ceased to grow. When supplied with 
free-living cryptophytes, the biomass of the culture and pigment contents 
increased. The number of ciliates with cryptophyte nuclei increased con- 
siderably within minutes after feeding. Chlorophyll a was synthesized then 
also within the ciliates. Details of food uptake could not be observed. It 
remained unclear whether non-functional chloroplasts and the nuclei are 
egested or digested. 

Mesodinium rubrum may occur in dense blooms. Its non-toxic red tides 
have extremely high rates of primary production, with some of the highest 
values for chlorophyll a and primary production in the marine environ- 
ment (Gustavson et al. 2000) - a convincing example for the efficiency of 
symbiotic cooperation - here at the expense of the cryptophytes. 

Is kleptoplastidy a model for a first step in the evolution of secondary chloroplasts? In 
protoctists the plastids remain active for only some days. In marine slugs (reviewed by 
Rumpho et al. 2000) chloroplasts of mainly siphonaceous green algae, red algae or Vaucheria 
are still functional after months (Vaucheria chloroplasts in Elysia chlorotica: nine months) 
in the absence of any algal nucleo-cytoplasmatic influence. The Vaucheria chloroplasts lie 
“naked” in the Elysia cytosol, surrounded only by two membranes, i. e., without the two 
membranes of the chloroplast ER and a vacuole membrane. Plastid protein synthesis has 
nevertheless been demonstrated. The question whether nuclear encoded proteins are syn- 
thesized in the slug cytosol and targeted to the chloroplasts has not been conclusively 
answered yet. It is also discussed whether the long-term activity of the kleptochloroplasts 
depends on an unusually high level of chloroplast gene autonomy, on extremely stable 
proteins, whether the minimal protein composition needed to support chloroplast activity 
is less than expected, whether mitochondrial or animal encoded proteins with related 
functions are redirected or whether lateral gene transfer has taken place here (Rumpho et 
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al. 2000). It should be added that the isolation of the kleptochloroplasts may abolish the 
“normal” turn-over of chloroplast proteins. 



4 Endosymbiotic Associations with Prokaryotic Algae 

An association between cyanobacteria and the apoplastidic euglenoid flag- 
ellate, Petalomonas sphagnophila, has recently been detected (Schnepf et 
al. 2002). It resembles kleptoplastidy in some respect. P. sphagnophila 
always contains Synechococcus-like cyanobacteria. Their number varies 
between 6 and 20 per cell. They lie individually in perisymbiont vacuoles, 
remain alive for at least several weeks, and are occasionally seen to divide, 
indicating a mutualistic relationship. Cyanobacteria in digestion indicate 
that they directly serve as food, too. The association is specific and seems 
to be a transient, relatively undeveloped form of endosymbiosis. 

P. sphagnophila lives under hypoxic conditions in floating Sphagnum 
mats of bog lakes. The production of oxygen by the symbiont may be as 
important for the host as the supply with carbohydrates or nitrogen com- 
pounds. 

A somewhat similar association has been found by Hargraves (2002). Hermesium adria- 
ticum , an apoplastidic flagellate, has become mixotrophic, harbouring numerous endobiotic 
Synechococcus-\ike cyanobacteria. They are not merely ingested, for dividing cells have been 
observed. 

The heterotrophic dinophysoid dinoflagellates Ornithocercus , Histioneis 
and Citharistes often are associated with Synechococcus or Synechocystis- 
like cyanobacteria, previously described as “phaeosomes” (Lucas 1991; 
Schnepf and Elbrachter 1992). In Citharistes and Histioneis they live ex- 
trathecally in special chambers, in Ornithocercus intracellularly in the 
region of the adcingular collar. In the Gulf of Aqaba they occur mainly in 
the fall, at a time of extended nitrogen limitation (Gordon et al. 1994). It is 
believed that the cyanobacteria supply the dinoflagellates with nitrogen 
compounds and the hosts provide the symbionts with the anaerobic mi- 
croenvironment necessary for efficient nitrogen fixation. 

Associations between endosymbiotic cyanobacteria and hosts which 
have their own chloroplasts likewise use (primarily?) the potential for 
nitrogen fixation of the prokaryotic partner (Floener and Bothe 1980; 
Janson et al. 1995). The filamentous cyanobacterium Richelia intracellu- 
laris is a temporary, intracellular, extrabiotic symbiont in the marine dia- 
toms Hemiaulus spp. and Rhizosolenia spp. (Villareal 1994; Jansson et al. 
1995). The filaments have terminal heterocysts. They may fix a considerable 
amount of nitrogen (Werner 1992). 
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The coccoid cyanobacteria in Rhopalodia are endobiotic. A diatom 
contains between two and five endosymbionts. They have a normal cell wall, 
a somewhat peculiar thylakoid system, lie in perialgal vacuoles and fix 
nitrogen (Floener and Bothe 1980). 

Geosiphon pyriforme is a consortium between prokaryotic alga and 
fungus. It initiated my own research in this field. A recent review of 
Geosiphon is given by Schufiler and Kluge (2001). The host is a coenocytic 
fungus related to the arbuscular mycorrhizal fungi (Schiifiler et al. 1994) 
and is now included in an new phylum, the Glomeromycota (Schiissler et 
al. 2001) - the algal partner is Nostoc punctiforme. Resting spores of the 
fungus do not contain cyanobacteria. When a resting spore germinates and 
the apical region of a hypha meets a Nostoc filament in a special develop- 
mental stage, the “early primordium”, it bulges to surround the alga and 
encloses it excepting heterocysts (Mollenhauer et al. 1966). After a shock 
period the then endobiotic Nostoc grows and forms filaments with hetero- 
cysts. They are enclosed in a single, peripheral compartment, the sym- 
biosome. The symbiosome membrane is a derivative of the host plas- 
malemma. Its inner surface is covered by a rudimentary fungal wall 
(Schiifiler et al. 1996). The Nostoc cells and the heterocysts have normal cell 
walls and a normal internal structure. The endosymbiotic cyanobacteria 
photosynthesize (Kluge et al. 1991) and show nitrogenase activity (Kluge et 
al. 1992). 

Geosiphon houses also endobiotic bacteria. They lie free in the cytoplasm, not in a perisym- 
biont vacuole (Schiifiler et al. 1994). Associations between bacteria and unicellular algae are 
common and reported from nearly all major groups (Chesnick and Cox 1986). Cases of 
parasitism are known (Schnepf et al. 1974). The endobiotic bacteria live in peribacterial 
vacuoles or free in the cytoplasm. The peribacterial membrane of the host is lost then. In a 
Cryptomonas sp. cell only bacteria within peribacterial vacuoles were seen to divide. Free 
bacteria in the same cell contained partly bacteriophages which were subsequently digested 
in autolysosomes (Schnepf and Melkonian 1990). 



5 Endosymbiotic Associations with Eukaryotic Algae 
5.1 Potentially Autonomous Endosymbionts 

The genetically autonomous partners discussed in this chapter can live 
independently, at least for some time. In nature they are metabolically 
interdependent. The algae (phycobionts) belong mainly to the chlorococ- 
cales (zoochlorellae) or to the dinoflagellates (zooxanthellae). The hosts are 
diverse protoctists or invertebrates. The zooxanthellae of marine hydrozoa 
and the zoochlorellae in freshwater hydrozoa, sponges and turbellaria are 
not within the scope of this review (see Werner 1992). 
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The phycobionts are generally enclosed in a special microhabitat, the 
perialgal vacuole or symbiosome. One of the exceptions is Noctiluca. In 
Southeast Asian waters a prasinophyte flagellate, Pedinomonas noctilucae , 
lives in the large central vacuole (Sweeney 1976). 

Diverse amoebae, thecamoebae, heliozoa and, especially, ciliates (holo- 
trichs, spirotrichs and peritrichs) have zoochlorellae (Reisser 1994; Gortz 
1996). The algae remain permanently in their hosts and are distributed to 
the daughter cells when the host cell divides. Both partners of the associa- 
tion between Paramecium bursaria and Chlorella can easily be cultivated 
separately but are in nature found only in symbiosis. A P. bursaria cell 
contains several hundred zoochlorelles. They are individually enclosed in 
a perialgal vacuole, the membrane of which surrounds tight-fitting the alga, 
even during and after cell division. Its structure and function differs from 
the membranes of food vacuoles (Meier et al. 1984). 

Symbiotic Chlorella strains secrete various sugars, some of them mainly 
maltose, much more than non-symbiotic strains. Isolated Chlorella of P. 
bursaria releases up to 90% of the photosynthetically fixed carbon (Reisser 
1994). The secretion is promoted by an acidic environment. During photo- 
synthesis the perialgal vacuole becomes acidified, suggesting that the sugar 
transport to the host is driven by proton antiport (Schiifiler and Schnepf 
1992). The host receives also O 2 from the phycobiont, in exchange for CO 2 
and inorganic ions. P. bursaria reacts phototactically (Niess et al. 1982) but 
not when aposymbiotic. It is a good shepherd, not only because it provides 
an optimal environment for photosynthesis but also because the algae in 
the symbiosomes are protected against the attacks of Chlorella viruses. Free 
in nature they are highly susceptible to viral infections (Reisser 1993). 

P. bursaria depends not exclusively on its zoochlorellae. It feeds phag- 
otrophically on bacteria and algae. In prolonged darkness the zoochlorella 
are also digested, perhaps because sugar secretion has ceased then. If they 
are then re-fed with suitable Chlorella , some of the algae are not passed into 
digestion vacuoles but are enclosed in perisymbiont vacuoles. Reisser 
(1993) suggests that the ciliates recognize suitable algae by carbohydrates 
on the surface of their cell wall which interact zipper-like with lektin-like 
proteins of the phagocytotic membrane to form the perialgal membrane 
that then does not fuse with the membranes of primary lysosomes. 

Zoochlorellae occur also in a few marine protoctists, such as foraminiferans, which may 
house also other algae (Gastrich 1987). Symbiotic chlamydomonal flagellates loose tran- 
siently their flagella, and diatoms do no longer form a frustule (Lee 1983). In culture, after 
their isolation they produce silica shells again, demonstrating that they belong to the pennate 
genera Fragilaria and Nitzschia (Lee et al. 1979). 
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5.2 Reduced Endosymbionts 
5.2.1 Endosymbionts with Nucleus 

Several dinoflagellates contain eukaryotic endosymbionts with chloro- 
plasts (reviewed in Schnepf 1993). The chloroplasts are not lost in perma- 
nent unialgal culture so that kleptoplastidy can be excluded. Mandelli 
(1968) found that Glenodinium (synon. Peridinium, Kryptoperidinium ) 
foliaceum has fucoxanthin as the main carotene instead of the characteristic 
dinoflagellate pigment, peridinin. Dodge (1971) detected that G. foliaceum 
has not only a dinokaryon but also a typical eukaryotic nucleus, belonging 
to a reduced cell with chloroplasts (but see Sect. 5.2.3!). Peridinium bal- 
ticum (Tomas and Cox 1973), Peridinium quinquecorne (Horiguchi and 
Pienaar 1991) and Gymnodinium quadrilobum (Horiguchi and Pienaar 
1994) are similar chimeras. 

The dinoflagellates are quite normal in shape and in their internal 
structure. They have genuine dinoflagellate plastids with a characteristic 
envelope of three membranes (Sect. 6.3.2). These are small and are without 
stacks of thylakoids and chlorophyll and are, therefore, not chloroplasts, 
functioning in photosynthesis. They contain many carotene droplets and 
function as stigmata. In “normal” phototrophic dinoflagellates the stigma 
is likewise a part of the chloroplast. 

The photosynthetically active chloroplasts have thylakoids in stacks of 
three and a girdle lamella. They are surrounded by two pairs of membranes, 
the chloroplast envelope and a peripheral envelope (“chloroplast ER”) (see 
Sect. 6.2). Their structure as well as their pigment composition suggests that 
the endosymbionts are derived from chromophytes (Withers et al. 1977; 
Horiguchi and Pienaar 1994). DNA sequence analyses revealed that the 
ancestor of the endosymbionts of P. balticum and G. foliaceum is a 
Navicula- like pennate diatom (Chesnick et al. 1996, 1997). Dinoflagellate 
and endosymbiont cytoplasms are separated by a single membrane, the 
host membrane because it is likely that the symbiont has been taken up by 
myzocytosis (Sect. 2.4). 

The two nuclei of P. balticum divide synchronously, the endosymbiont 
nucleus amitotically (Tippit and Pickett-Heaps 1976). Its (consequent?) 
variance in appearance resembles a ciliate macronucleus, suggesting that it 
has become genetically incomplete. During sexual reproduction the 
karyogamy of the dinoflagellate is followed by that of the endosymbiont 
(Chesnick and Cox 1989). 

Peridinium and Gymnodinium belong to different orders. It is probable 
that the associations arose, at least in part, from separate endosymbiotic 
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events, while there are also alternative explanations (Morris et al. 1993; 
Horiguchi and Pienaar 1994). 



5.2.2 Endosymbionts with Nucleomorph 

Cryptomonads and chlorarachniophytes are chimeras, with reduced nu- 
clei, “nucleomorphs”, in the photosynthetic endosymbiont. The crypto- 
monad chloroplasts contain phycobiliproteins. They are not assembled in 
phycobilisomes at the outside of the thylakoids but are localized inside the 
thylakoid lumen, in contrast to most other algae with phycobilins (an 
exception is Dinophysis, Sect. 6.4.2). The chloroplasts are surrounded by 
two pairs of membranes. The inner pair is the chloroplast envelope. The 
outer pair is the “chloroplast ER”. Its outer membrane is studded with 
ribosomes and is continuous with the nuclear envelope (Gibbs 1981). 

A “periplastidal cytoplasm” between the two membrane pairs contains 
ribosomes of the eukaryotic type, starch grains, a few vesicles and the 
nucleomorph. This special structure is surrounded by a nuclear envelope- 
like membrane pair with a few pores. It is undoubtedly the vestigial nucleus 
of a reduced eukaryotic endosymbiont (Gillott and Gibbs 1980). It divides 
amitotically just prior to the host cell nucleus (McKerracher and Gibbs 
1982). The nucleomorph contains DNA (Hansmann et al. 1985; Ludwig and 
Gibbs 1985) as well as RNA, a part of which is concentrated in a fibrillo- 
granular region, a structural and functional equivalent of a nucleolus 
(Hansmann 1988). The nucleomorph DNA is organized in three linear, tiny 
chromosomes. They encode a mere 511 genes, 30 of which are for proteins 
required by the plastids (Douglas et al. 2001). 

The nucleomorph and its genes share a common ancestry with the 
nucleus of red algae (Leitsch et al. 1999; Douglas et al. 2001). The starch 
grains in the periplast cytoplasm occur in the same compartment as flo- 
rideen starch. It is generally accepted now that the endosymbiont is a 
reduced red algal cell. The inner membrane of the periplastidal envelope is 
derived from the plasmalemma of the red alga, the outer one represents the 
membrane of the perialgal vacuole which is connected now with the ER of 
the host cell and the nuclear envelope. 

The chlorarachniophytes are a small phylum comprising amoeboid, 
coccoid and flagellate species (Moestrup and Sengco 2001). Chlorarachnion 
reptans forms large plasmodia and is mixotrophic, phagocytizing small 
algae and flagellates. Its chimeric nature has been revealed by Hibberd and 
Norris (1984). Further details of the chlorarachniophyte fine structure have 
been described by Moestrup and Sengco (2001). 
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The chloroplasts contain chlorophyll a and b (Wilhelm et al. 1991). They 
have a stalked pyrenoid but do not contain starch. As in cryptomonads, 
their envelope consists of two membranes and is surrounded by a thin layer 
of periplastidal cytoplasm which is delimited against the host cytoplasm by 
the two membranes of the periplastidal envelope. In contrast to crypto- 
monads the outer membrane of this pair is not studded with ribosomes and 
not connected with the host ER, thus demonstrating more clearly the 
character of a perisymbiont membrane. The pyrenoid is capped by a vesicle 
which is located in the host cytoplasm but in close contact with the periplast 
envelope. It encloses a storage carbohydrate, presumably a (3-1,3-linked 
glucan. 

The periplastidal cytoplasm contains ribosomes of the eukaryote type 
and a nucleomorph resembling a cryptomonad nucleomorph. It divides by 
infolding of the envelope membranes, microtubules are not involved (Lud- 
wig and Gibbs 1989). The DNA is arranged in three tiny, linear chromo- 
somes, as in cryptomonad nucleomorphs, but organization of the genome 
differs considerably (Douglas et al. 2001, McFadden 2001). It contains about 
300 genes packed into a DNA of 30 kb (Gilson and McFadden 1997). During 
the evolution of the chlorarachniophytes genes have been transferred from 
the endosymbiont nucleus into the host nucleus (Deane et al. 2000). 

The pigments as well as the molecular data (Van De Peer et al. 1996) 
indicate that the endosymbiont is a vestigial green alga. The host is related 
to the filose amoebae with Euglypha and Paulinella (Cavalier-Smith and 
Chao 1996/97). Curiously one species of Paulinella has cyanelles (Sect. 
6 . 1 . 1 ). 

5.2.3 Enigmatic Chimeras 

The dinoflagellates Lepidodinium viride (= “strain Y 100”) (Watanabe et al. 
1987, 1990) and Gymnodinium chlorophorum (Schnepf 1993, Elbrachter 
and Schnepf 1996) can be maintained in unialgal cultures. They are similar 
in essential details. The dinoflagellates are apoplastidic and harbour a 
green, vestigial, eukaryotic endosymbiont. The partners are separated by a 
pair of ribosome-free membranes, the plasmalemma of the endosymbiont 
and the symbiosome membrane. Obviously the endosymbiont has been 
taken up by phagocytosis, not by myzocytosis. The chloroplasts have the 
chlorophylls a and b , the carotene prasinoxanthin but lacks chlorophyll c 
and the typical dinophycean carotene peridinin. They are enveloped by two 
membranes. The thylakoids are arranged in stacks of three. A pyrenoid is 
present. Starch grains are found only in the dinoflagellate cytoplasm. 
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Chloroplast ultrastructure and pigmentation suggest that the endosym- 
biont is derived from a prasinophyte, the cytoplasm of which is highly 
reduced. Mitochondria, dictyosomes, an elaborated ER and cytoskeletal 
structures are absent, only abundant ribosomes and a few vesicles found in 
the narrow periplastidal cytoplasm. Some of these “vesicles” are enveloped 
by a pair of membranes which is interrupted by pores and has the appear- 
ance of a nuclear envelope. These “vesicles” thus resemble nucleomorphs. 
In contrast to a typical nucleomorph their interior is loose and homoge- 
nous. It has to be confirmed whether they are indeed relicts of the prasino- 
phyte nucleus and contain DNA, which could not yet be detected. A DNA 
is required for the synthesis of the ribosomal constituents in the vestigial 
prasinophyte cytoplasm. 

It was a great surprise when recently it turned out that there are strains 
of Glenodiniumfoliaceum in which an endosymbiont nucleus could not be 
detected (Kempton et al. 2002). Morphological, biochemical and DNA 
analyses clearly showed that the mononucleate G. foliaceum strain from 
South Carolina is identical with the previously investigated binucleate 
strains and has chloroplasts with the same pigment composition. Further 
mononucleate strains (with a dinokaryon only) have been found in Florida 
and in the Baltic Sea off Hiddensee (Elbrachter, pers. comm.). To make the 
curious situation even more complex: There are strains with dinokaryon 
and endosymbiont nucleus but lack a stigma. Obviously the reduced dino- 
phycean plastid (see Sect. 5.2.1) is lost here. Some of the strains are not yet 
in culture so that kleptoplastidy could be possible, but certain strains have 
been in culture for a long time, in which kleptoplastidy can be excluded 
(Elbrachter, pers. comm.). These unexpected organisms wait to be investi- 
gated further. They demonstrate the incredible plasticity of dinoflagellate- 
endosymbiont associations. 



6 Chloroplasts 

The idea of a symbiotic origin of plastids is generally accepted now. Chlo- 
roplasts evolved from endocytobiotic symbionts, either directly from a 
prokaryotic alga (primary plastids) or from a eukaryotic alga (secondary 
endosymbiosis, complex plastids). 



6.1 Primary Chloroplasts: Two-Membrane Envelopes 

The glaucocystophytes and rhodophytes have chloroplasts with phycobil- 
ins located in phycobilisomes at the outside of the unstacked thylakoids. 
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They store carbohydrates in the form of starch grains free in the cytosol. 
The glaucocystophytes have a vestigial cell wall between the two mem- 
branes of the cyanelle (chloroplast) envelope which is lacking in rhodo- 
phyte chloroplasts. The chloroplasts of chlorophytes/embryophytes con- 
tain chlorophylls a and b, stacked thylakoids and deposit starch grains in 
their matrix. 

In the three taxa with primary plastids apoplastidic species are not 
known though apochlorotic ones can be found in rhodophytes and chloro- 
phytes/embryophytes. The plastids, and in apochlorotic organisms the 
leucoplasts, are the sole site for the synthesis of fatty acids, certain amino 
acids and haem (Howe and Smith 1991). 



6.1.1 Cyanelles 

The glaucocystophytes are a small group of algae that includes flagellated, palmelloid and 
coccoid cells (Bhattacharya and Medlin 1995). The cyanelles as well as the presence of 
peripheral lacunae and a cruciate flagellar root system with multilayered structures charac- 
terize the Glaucocystophytes (Kies and Kremer 1990). The different genera had an uncertain, 
often changing position before (Schnepf et al. 1966; Schnepf and Brown 1971). 

Cyanelles have frequently been considered as models for the symbiogenesis 
of primary chloroplasts. Indeed they have both features of free-living 
cyanaobacteria and cell organelles (McFadden 2001). They retain their 
shape when isolated, due to the presence of a vestigial but rigid cell wall, the 
existence of which has been demonstrated electron microscopically in 
Glaucocystis (Schnepf et al. 1966). It is dissolved by lysozyme (Schenk 1970: 
Cyanophora, Scott et al. 1984: Glaucocystis), like a typical peptidoglycan 
layer. Its synthesis is blocked by penicillin (Kies 1988). The cyanelles of 
Cyanophora have hydrogenases with similar properties as in free-living 
cyanobacteria (Bothe and Floener 1978) but they lack respiration (Floener 
and Bothe 1982) and the enzymes of nitrate assimilation are distributed as 
in cells with chloroplasts (Floener et al. 1982). 

The complete sequence of the Cyanophora paradoxa cyanelle genome is 
now available (Loffelhardt et al. 1997). With about 135 kb it is one order of 
magnitude smaller than that of free-living cyanobacteria and is comparable 
to that of plastids. It comprises 191 genes, including the standard set of 
chloroplast-encoded genes. A certain number of cyanelle genes are absent 
in higher plants. Among them is the gene for the small subunit of ribulose 
bisphosphate carboxylase (rubisco), but also in rhodophytes it is in the 
plastid DNA. There are further similarities between cyanelle and rhodo- 
phyte chloroplast genomes. Curiously, the cyanelle genome codes only for 
one of the proteins needed to synthesize the peptidoglycan wall layer. 
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The most conspicuous feature of the gross organization in the circular 
cyanelle genome is the inverted repeat, typical for chloroplast DNA. Cyan- 
elles are, therefore, true chloroplasts with a vestigial prokaryotic cell wall 
as distinct character. Schenk (1994) consequently uses the term “cy- 
anoplast” instead of “cyanelle”. 

The wall remnant lies between two membranes. The inner one is un- 
doubtedly derived from the plasmalemma of the endosymbiotic cyanobac- 
terium. The nature of the outer one is controversial (Whatley 1993). It was 
initially thought that it represents the membrane of the symbiosome vesicle 
(Schnepf 1966; Schnepf et al. 1966) as in Geosiphon (see Sect. 4). Cavalier- 
Smith (1993) argues on the contrary that it is homologous with the outer 
membrane of the symbiotic cyanobacterium. 

Other controversial issues are the age of the glaucocystophytes and whether they are 
descendants of a link in the monophyletic evolution of primary chloroplasts (Kowallik 1997) 
or are a relatively young group and an analogous model for (polyphyletic) plastid evolution 
(Schenk 1994). Because of the small size of the cyanelle genome and because the partners 
are highly adapted to each other, it is believed that the glaucocystophytes are a relatively old 
taxon (Kies and Kremer 1990). Most molecular biological data suggest a monophyletic origin 
of the three algal lineages with primary chloroplasts (Bhattacharya and Schmidt 1997). 

Quite another organism with cyanelles is the thecamoeba Paulinella chro- 
matophora (Kies 1974). It contains two sausage-shaped cyanelles and is 
related to the chlorarachniophytes (see Sect. 5.2.2). The other species of the 
genus Paulinella are cyanelle-free. The cyanelles have a thin (6-13 nm) 
vestigial cell wall and are lying in perisymbiont vesicles. It is discussed 
whether P. chromatophora and the glaucocystophytes represent analogous 
evolutionary branches, from independent primary endosymbiotic events, 
or whether the cyanelles of P. chromatophora are gained from a secondary 
symbiosis of a glaucocystophyte (Bhattacharya and Medlin 1995; Bhat- 
tacharya et al. 1995). 



6.1.2 Rhodophyte and Chlorophyte Plastids 

The two membranes of the chloroplast envelope of rhodophytes and chlo- 
rophytes/embryophytes have likewise be taken as evidence for the symbio- 
genetic origin of the organelles directly from cyanobacteria, with the inner 
membrane homologous to the plasmalemma of the symbiont and the outer 
one homologous to the membrane of the perisymbiont vesicle, a derivative 
of the phagosomal membrane (Schnepf 1964, 1966; Whatley 1993). The 
chemical composition of the outer membrane differs, however, from that 
of a phagosomal membrane (Joyard et al. 1991) and resembles somehow 
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that of the outer membrane of cyanobacteria and Gram-negative bacteria 
(Cavalier-Smith 1993, 2000). Both prokaryotes are enveloped by two “mem- 
branes” (plus cell wall proper). Their major glycolipids resemble that of the 
two membranes of the plastidal envelope. An exception is phosphatidyl 
choline, present only in the outer leaflet of the outer membrane of the 
plastidal envelope and common in eukaryotic membranes but absent from 
prokaryotic membranes and the inner plastid envelope membranes (Cava- 
lier-Smith 1993; Whatley 1993). It has also to be remembered that a peri- 
symbiont membrane is frequently lost around bacteria (Sect. 4). The trans- 
fer of nuclear gene-encoded proteins into chloroplasts would perhaps be 
simpler if the accepting (outer) organelle membrane is of eukaryotic origin 
and faced as other membranes in the endomembrane system. 

The problem whether primary chloroplasts have a monophyletic or a polyphyletic origin is 
to be resolved by molecular rather than by microscopical methods and is discussed here but 
briefly. The prochlorophytes, prokaryotic algae distinguished from cyanobacteria s. str. by 
the presence of chlorophylls a and b as photosynthetic pigments, the lack of phycobilins and 
the closely stacked thylakoids have initially been considered as a missing link in the genesis 
of green algal chloroplasts (Swift and Palenik 1993). Analyses of various gene sequences 
suggest, however, that none of the known prochlorophytes can be an ancestor of the 
chlorophyte chloroplasts. They even showed that the prochlorophytes are polyphyletic. 

It is generally assumed now that the three lines of primary plastids (in 
glaucocystophytes, rhodophytes and chlorophytes) are of monophyletic 
origin, from a single endosymbiosis of a heterotrophic phagotrophic flag- 
ellate with a cyanobacterium (Melkonian 1996; Delwiche and Palmer 1997; 
Kowallik 1997; McFadden 2001). This opinion is not always accepted (Gue- 
neau et al. 1998). 

A gene transfer from the prokaryote into the eukaryotic nucleus and the 
therewith necessarily connected evolution of protein import mechanisms 
from the cytosol into the plastid (Bodyl 2002) would characterize the first 
step of the transition from symbiont to organelle. 



6.2 Complex Plastids: Four-Membrane Envelopes 
6.2. 1 Plastids of “Golden Algae” 

The chloroplasts of various groups of algae were shown to be additionally 
surrounded by a pair of membranes, the periplastidal ER or chloroplast ER 
(Gibbs 1981) . Here, in the “golden algae” (Medlin et al. 1997), namely the 
photosynthetic heterokontophytes (chromophytes) and haptophytes 
(prymnesiophytes) the vestigial eukaryotic cytoplasm between the perip- 
last ER and the chloroplast, still found in cryptophytes and chlorarachnio- 
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phytes, is further reduced. The chloroplasts contain chlorophylls a and c 
and (3-1,3-glucans as storage polysaccharide, deposited in vacuoles outside 
the chloroplasts. The space between the periplast ER and the plastids 
contains a periplast reticulum, consisting of a few smooth vesicles and 
tubules. Nucleomorphs and ribosomes are lost. Leucoplasts of golden algae 
have a periplast ER and a periplast reticulum, as well (Schnepf 1969b: the 
diatom Nitzschia alba). 

The outer membrane of the periplastidal ER belongs to the host. It is 
studded with ribosomes and is continuous with the nuclear envelope when 
feasible by spatial relationships. The inner membrane of the periplastidal 
ER is interpreted to be likewise a part of the “host” ER. Alternatively it has 
been suggested that the inner membrane is homologous to the plas- 
malemma of the former eukaryotic endosymbiont while the outer one is 
derived from the membrane of the perisymbiont vesicle (Whatley et al. 
1979). 

DNA sequence analyses indicate that the endobiotic ancestors of chro- 
mophyte and haptophyte chloroplasts were red algal-like (Melkonian 1996; 
Kowallik 1997) and arose polyphyletically from unrelated symbiotic events 
(Daugbjerg and Andersen 1997). 



6.2.2 Plastids of Apicomplexans 

It was a great surprise when plastids were detected in apicomplexan para- 
sites like Toxoplasma gondii , Eimeria tenella and Plasmodium falciparum. 
It has been shown recently that they have an envelope of four membranes 
(Kohler et al. 1997) In Psalteriomonas even traces of chlorophylls a and b 
and carotenoids have been detected (Hackstein et al. 1997). The coccidial 
plastid DNA has a length of 35 kb and contains 66 genes in P. falciparum. 
Most of them are involved in transcription and translation. The ribosome 
genes are arranged as inverted repeats, as in other plastids (Hackstein et al. 
1997) 

The apicomplexans (alveolates) could thus be taken as algae with ex- 
tremely reduced plastids. Probably the plastids have been acquired from a 
chlorophyte by secondary endosymbiosis (Kohler et al. 1997). Red algae, 
perhaps via a chromophyte-like organism (McFadden 2001), or dinoflagel- 
lates (Melkonian 1996) are also suspected as plastid source. DNA sequence 
analyses indicate that the apicomplexans evolved from a single, free-living, 
photosynthetic progenitor (Denny et al. 1998; McFadden et al. 1997). 
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6.3 Complex Plastids: Three-Membrane Envelopes 

The photosynthetic euglenophytes and the peridinin-containing dinoflag- 
ellates have chloroplasts with an envelope of three membranes. Cavalier- 
Smith (1993) suggested that the three membranes result from a primary 
endosymbiosis, derived from the phagosomal membrane and the outer and 
inner membrane of a cyanobacterium. It is, however, largely believed now 
that the chloroplasts arose from a secondary endosymbiosis as already 
suggested by Gibbs (1978). The initially four membranes (Sects. 5.2.2 and 
6.2) are reduced. The outermost membrane does not bear ribosomes. It may 
be homologous to the plasmalemma of the eukaryotic symbiont and the 
two inner ones with the envelope of the primary chloroplast while the 
phagosomal membrane is lost (Gibbs 1978). Others have proposed that the 
outer membrane is derived from the phagosomal membrane (Whatley et 
al. 1979). An uptake of the later endosymbiont by myzocytosis would 
explain the presence of only three envelope membranes (Bodyl 2002; Shin 
et al. 2002). 



6.3.1 Euglenophyte Chloroplasts 

Only about 30% of the euglenoids have plastids (Whatley 1993). The others 
are obligate heterotrophs. The euglenophyte chloroplasts contain the chlo- 
rophylls a and b , like chlorophyte chloroplasts. The spectrum of the xan- 
thophylls differs from that of chlorophytes. The thylakoids are in stacks of 
three or two. The storage carbohydrate is paramylum, a P-1,3 glucan, 
deposited in the form of crystalline granules in vacuoles. 

The euglenoid plastid DNA has a peculiarity. The genes for rRNA are 
arranged in tandem, not in inverted repeats as in other plastids. The 
genome organization and its gene content is otherwise similar to that of 
chlorophyte chloroplasts (Hallick et al. 1993), suggesting that a chlorophyte 
was the ancestor of the euglenophyte chloroplasts (Bhattacharya and 
Medlin 1998). 

The euglenoid “hosts” are a sister group of the kinetoplastids (Triemer and Farmer 1991), 
a group of apoplastidic protoctists. The comparison of kinetoplastid (bodonid) cytostomes, 
colourless euglenoid cytostomes and the reservoir pockets of photosynthetic euglenophytes 
suggests that the photosynthetic euglenophytes arose from phagotrophic or myzocytotic 
ancestors (Shin et al. 2002). 
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6.3.2 Dinoflagellate Chloroplasts with Peridinin 

The dinoflagellates are most diverse with respect to their plastids. Only 50% 
are photosynthetic. Many families and even genera contain taxa with or 
without chloroplasts or with different types of plastids (Schnepf and El- 
brachter 1999; G. Hansen 2001). Even parasitic dinoflagellates may have 
chloroplasts. The “typical” dinoflagellate chloroplasts are characterized by 
chlorophylls a and C2 and the carotenoid peridinin. The other types are 
discussed in Sect. 6.4. Only the peridinin-chloroplasts have a three-mem- 
brane envelope. The thylakoids are arranged in stacks of three, occasionally 
of four or two. The storage carbohydrates are deposited as starch grains 
within the cytosol. 

The outermost membrane is ribosome-free. The middle membrane 
shows sometimes an unusual structure in thin sections (Schnepf 1993: 
Prorocentrum micans) as well as after freeze-fracturing (Sweeney 1981: 
Gonyaulax polyedra). In Prorocentrum micans and some other dinoflagel- 
lates, mostly Prorocentrales, it is even locally absent, especially close to the 
pyrenoid (Schnepf and Elbrachter 1999). The question consequently arises 
whether it is actually a “biomembrane” in the narrow sense. In uninjured 
cells each true biomembrane surrounds a compartment completely, with- 
out gap. The molecular architecture of biomembranes does hardly allow 
free edges. 

Cavalier-Smith (1993) discussed whether the peridinin chloroplasts are 
directly derived from cyanobacteria. The middle membrane would then be 
homologous with the outer membrane of the prokaryote and represent an 
example of membrane loss, an indication that the outer membrane of 
Gram-negative bacteria and cyanobacteria are not “true” biomembranes, 
but membrane-like cell wall layers. This interpretation is suppported by 
their “aberrant” position (they do not separate a cytoplasmic from a non- 
cytoplasmic compartment, see Introduction) and the lack of active trans- 
port processes. 

The original idea of Gibbs (1978) that the peridinin chloroplasts are the 
result of a secondary endosymbiosis is favoured now (Medlin et al. 
1997). The endosymbiont was perhaps red algal-like (Melkonian 1996; 
Zhang et al. 1999; McFadden 2001). If the ingestion was by myzocytosis the 
outer envelope membrane would represent the phagosomal membrane, the 
two inner ones the chloroplast membrane. Hypotheses about protein im- 
port into the chloroplast have to consider the presence or absence of the 
middle membrane. It might be extremely permeable. 

The DNA of peridinin chloroplasts is unique in consisting of minicircles each containing a 
single gene (Zhang et al. 1999; Barbrook and Howe 2000). Another peculiarity concerns 
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rubisco. All organisms with oxygenic photosynthesis have form-I rubisco, with eight cata- 
lytic large and eight small subunits, but peridinin dinoflagellates have form-II rubisco with 
only two large subunits, an enzyme which was previously known only from certain anaerobic 
proteobacteria such as Rhodospirillum rubrum (Morse et al. 1995; Rowan et al. 1996). In 
contrast to form-I rubisco the dinoflagellate rubisco is encoded as a polyprotein by nuclear 
genes. Palmer (1995) discussed several scenarios how rubisco II genes might be incorporated 
into the dinoflagellate nucleus. 



6.4 Aberrant Dinoflagellate Chloroplasts 

Aberrant dinoflagellate chloroplasts have either fucoxanthin or fucoxan- 
thin derivatives instead of peridinin as main carotenoid or they contain 
phycobilins (reviewed by Schnepf and Elbrachter 1999). 



6.4.1 Fucoxanthin Chloroplasts 

The most extensively investigated dinoflagellate with fucoxanthin-chloro- 
plasts is Gymnodinium mikimotoi. The characteristic carotenoid is 18 J -hex- 
anoylfucoxanthin (Tangen and Bjornland 1981), a pigment known other- 
wise only from Emiliana huxlei and some other prymnesiophytes. The G. 
mikimotoi chloroplasts have thylakoids in stacks of three and a prominent 
stalked pyrenoid. It is thylakoid-free and connected with the chloroplast by 
a small bridge. The chloroplast envelope is not well enough preserved in 
electron microscopical studies (Kite and Dodge 1985; Schnepf and El- 
brachter 1999) to state definitively the number of membranes. Probably it 
consists of two membranes, and a chloroplast ER, as present in Emiliana , 
is lacking here. The further organelle-cytoplasm interface is likewise not yet 
fully explored. G. mikimotoi is grown in unialgal culture. Kleptoplastidy is 
thus excluded. One may speculate that the chloroplasts are derived from a 
prasinophyte, taken up by myzocytosis. The periplastidal ER and other 
prasinophyte cell components have apparently been lost subsequently. 



6.4.2 Phycobilin-Chloroplasts 

Phycobilin- containing chloroplasts are known from Dinophysis and some 
related dinoflagellates. There are also apoplastidic Dinophysis species (Hal- 
legraeff and Lucas 1988; Schnepf and Elbrachter 1988). Some of the photo- 
synthetic species are mixo trophic and feed by myzocytosis (Jacobson and 
Andersen 1994). 
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The chloroplasts have thylakoids in stacks of two. Their lumen is rela- 
tively wide and filled with an electron-dense material. They resemble highly 
the thylakoids of cryptophytes (Schnepf and Elbrachter 1988) and contain 
likewise phycoerythrin in their lumen (Vesk et al. 1996). The major carote- 
noid is alloxanthin, as in cryptophytes. In contrast to previous statements 
peridinin is absent (Meyer-Harms and Pollehne 1998) as supposed earlier 
(Vesk et al. 1996). The chloroplast envelope consists of two membranes 
(Schnepf and Elbrachter 1988). Starch grains are deposited in the cyto- 
plasm, as in other dinoflagellates. 

Photosynthetic Dinophysis species could as yet not be grown in culture. 
The idea of Melkonian (1996) that the chloroplasts may be kleptochloro- 
plasts is nevertheless unlikely because chloroplast-free cells have never 
been found in photosynthetic species. Schnepf and Elbrachter (1988) sug- 
gested that the chloroplasts were derived from a cryptophyte, thus repre- 
senting a tertiary endosymbiosis, with subsequent loss of cryptophyte 
membranes and cytoplasm. Cavalier-Smith (1993) assumed on the contrary 
that Dinophysis chloroplasts result from a primary symbiosis directly from 
a cyanobacterium and represent an early stage of dinophyte chloroplast 
evolution. His idea was based on the wrong assumption that Dinophysis 
chloroplasts contain phycobilins as well as peridinin. Molecular studies 
indicate moreover that the Dinophysales are not primitive dinoflagellates 
but a sister group of the Prorocentrales and that the photosynthetic species 
have evolved rather recently (Rehnstam-Holm et al. 2002). 



7 Conclusions and Perspectives 

This review was initiated by the question of the general validity of the £C rules 
of cell compartmentation” (Sect. 1), especially for associations between 
protoctists and microalgae. It was shown that there are, indeed, some 
exceptions (Sects. 2.1.1, 4, 5.2.1). The host plasmalemma-derived mem- 
brane around endobiotic parasites and symbionts is lost sometimes. A 
myzocytotic uptake of host cytoplasm results likewise in a single mem- 
brane, a host membrane, between host/predator and symbiont/prey (Sect. 
2.4). In most parasitic and symbiotic associations the rules are not violated 
(Sects. 2, 3, 4, 5) In a variety of associations the interface between the two 
partners is not yet clearly explored (Sects. 2.1.1, 2.7, 5.1). 

The further search for protoctists feeding on or symbiotic with algae will 
result in new insights in their structural and cell biological relationships as 
well as in the biological role of those associations which has been underes- 
timated for a long time. They will furthermore give new hints on the 
symbiogenesis of plastids. 
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The endosymbiotic origin of plastids from a previously free-living cy- 
anobacterium, first deduced from structural evidence (Sect. 1) is well estab- 
lished by analyses of plastid DNA phylogeny. It is generally accepted now 
that the plastids with two envelope membranes trace back to a single 
endosymbiosis with a heterotrophic, phagotrophic flagellate as host (Sect. 
6.1). Glaucocystophytes, rhodophytes and chlorophytes/embryophytes are 
thus sister groups, the glaucophytes being the least evolutionarily different 
from cyanobacteria. Further research on DNA phylogeny will refine our 
image of the evolutionary scenario and will perhaps present further sur- 
prises as was the detection that in rhodophytes (and in algae with secon- 
darily gained rhodophyte chloroplasts) the genes encoding rubisco are not 
of cyanobacterial but proteobacterial provenance (Delwiche and Palmer 
1997). 

Biochemical and molecular biological studies will perhaps resolve the 
controversy around the homology of the outer membrane of the plastid 
envelope (Sect. 6.1.2). Is it derived from the host plasmalemma (via the 
membrane of the symbiosome) or does it represent the “outer membrane” 
of the symbiotic cyanobacterium? In this context it is necessary to discrimi- 
nate between biomembranes s. str. and membrane-like cell wall layers 
which do not participate in active transport processes. 

There is also general agreement about the origin of plastids with an 
envelope of two membrane pairs from a secondary endosymbiosis between 
(unrelated) eukaryotic hosts and red or green algae, respectively (Sect. 6.3). 
The algae with nucleomorphs represent stages on the way from endosym- 
biont to organelle (Sect. 5.2.2). Not yet fully clear is the number of secondary 
symbioses having led to complex chloroplasts. 

An interesting idea about the evolutionary advantages of secondary 
chloroplasts over primary ones was recently proposed (Lee and Kugrens 
2000). The latter authors noticed that the (successful) secondary symbioses 
arose before 275 Ma, at a time when the atmospheric CO2 level had de- 
creased drastically. An acidification of the perisymbiotic space, which is 
homologous to a digestion vacuole, would remarkably improve the supply 
of rubisco with CO2. In the alkaline sea water inorganic carbon is dissolved 
in the form of HCO3 - which is not accepted by rubisco. Back then, but not 
before that time, algae with secondary plastids would have had a selective 
advantage over other existing algae, as perhaps in the case of kleptochloro- 
plasts (Sect. 3). 

This discussion includes also the peridinin-chloroplasts of dinoflagel- 
lates and euglenophyte chloroplasts, both with three-membrane envelopes 
(Lee and Kugrens 2000). 

They are most probably the result of secondary endosymbioses though 
an origin via primary symbioses is also discussed (Sect. 6.3). It is perhaps 
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important to note in this context that some dinoflagellates and a few 
euglenoids ingest host cytoplasm by myzocytosis, leaving out the plas- 
malemma (Sect. 2.4), and that both groups comprise many apoplastidic 
species. While some heterokont taxa, e.g., diatoms (Sect. 6.2.1), comprise 
apochlorotic but no apoplastic species, many larger taxa of heterokonts, 
e.g., the oomycetes, are apoplastidic. Secondary chloroplasts seem to be less 
firmly integrated into the host than primary ones which are never lost but 
transformed into leucoplasts in non-photosynthetic species. 

Were the ancestors of apoplastidic heterokonts, dinoflagellates, 
euglenoids and algae with secondary chloroplasts originally phagotrophic 
mixotrophs with primary chloroplasts which were then lost? If so, the 
incorporation of a secondary endosymbiont would be facilitated by the 
presence of genes for photosynthetic activities already transferred into the 
nucleus, and of protein import mechanisms (Hauber et al. 1994). A coexis- 
tence of two different types of photosynthetic organelles would, however, 
result in competition which is less likely and not represented by living 
examples. Endosymbiotic cyanobacteria in cells with chloroplasts fix pri- 
marily nitrogen (Sect. 4). 

The photosynthetic endosymbiont of Glenodinium foliaceum and simi- 
lar dinoflagellates coexist with a genuine achlorophyllous plastid which is 
reduced to an eyespot (Sect. 5.2.1). The dinoflagellates with their different 
types of secondary and tertiary chloroplasts and their different strategies 
of food uptake (Sects. 2.2, 2.3, 3, 4, 5.1, 5.2.1, 5.2.3, 6.3, 6.4) comprise some 
not yet fully understood symbiotic associations (Sect. 5.2.3) and will surely 
present further surprising examples. Microscopical studies of extraordi- 
nary algae are especially useful for understanding the symbiogenetic origin 
of chloroplasts. In the extremes the borders of general laws can be traced 
better than in “normal” organisms. The main progress will come by mo- 
lecular phylogenetic analyses, the analyses of lateral gene transfer and of 
the mechanisms of protein import into the organelles, considering klepto- 
chloroplasts, as well. 

Many of my own studies have been carried out at Wattenmeerstation List/Sylt, Biologische 
Anstalt Helgoland (now Alfred Wegener Institute). I gratefully acknowledge the fruitful 
cooporation with Dr. G. Drebes, Dr. M. Elbrachter, H. Halliger, G. Deichgraber and Dr. S. 
Kuhn. Dr. M. Elbrachter and Prof. Dr. P. Sitte provided valuable information and helped to 
condense the text. 
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Recombination: 

Implications of Single Nucleotide Polymorphisms 
for Plant Breeding 

Gisela Neuhaus and Renate Horn 



1 Introduction 

Single-nucleotide polymorphisms (SNPs) are the most abundant type of 
DNA sequence variation in humans, animals and plants (Landegren et al. 
1998; Schafer and Hawkins 1998; Wang et al. 1998; Gilles et al. 1999; See et 
al. 2002). SNPs, which are in the narrow sense single-base substitutions 
(transitions or transversions) in a DNA sequence, as well as small insertions 
and deletions (InDels) represent the most promising markers for plant 
breeding, so far. Although SNPs are biallelic, large numbers can be com- 
bined to form haplotypes making SNPs highly informative in comparison 
to other types of molecular markers such as RFLPs or SSRs (Meyer et al. 
2000; Rafalski 2002a). Recent advances in SNP technologies, especially in 
the field of high-throughput applications (Gupta et al. 2001, Jander et al. 
2002), allow SNP scoring and screening in plants on a large scale or even 
genome-wide (Buckler and Thornsberry 2002; Rafalski 2002a, b; Cho et al. 
1999). 

For plant breeding, SNPs are interesting for a wide range of applications, 
like e.g. estimation of genetic diversity or determination of allele distribu- 
tions and frequencies within a large number of genotypes. Association 
mapping with SNPs based on candidate genes or genome-wide approaches 
also offers the possibility to resolve complex traits and the involved genes. 
Here, we will give an overview of SNP technologies and applications and 
will highlight potential future uses of SNPs in plant breeding. 



2 Single Nucleotide Polymorphisms in Higher Plants 

2.1 Identification of Single Nucleotide Polymorphisms 

Several strategies have been employed to discover SNPs. Sequence data 
from direct sequencing or after cloning amplicons or fragments represent 
the major source for detecting SNPs. In small, trait-oriented approaches, 



Progress in Botany, Vol. 65 
© Springer- Verlag Berlin Heidelberg 2004 




56 



Genetics 



sequences of candidate genes or markers linked to traits of interest have 
been successfully used to identify SNPs. 

AFLP markers all linked to one of two loci, rhgl and Rhg4> which confer resistance to soybean 
cyst nematode, have proven to be a good source in soybean to develop SNP-based markers 
(Meksem et al. 2001a). In maize, DNA sequence data were used from coding regions and 
flanking sequences of four genes, indeterminate 1 ( idl ), teosintebranchedl(tbl), dwarf8 (d8) 
and dwarf3(d3), to identify SNPs (Remington et al. 2001). To determine the genetic diversity 
based on SNPs, six key genes involved in starch production in maize: amylose extender 1 
(ael), brittle2 (bt2)> shrunkenl ( shl ), shrunken2 (sh2) y sugary 1 (si/I), and waxyl (wxl) were 
analysed for both coding and non-coding regions in 30 maize inbred lines (Whitt et al. 2002). 
Also, in maize, resistance gene analogues (RGA) mapped in relation to sugarcane mosaic 
virus (SCMV) resistance genes ( Scmvl and Scmv2) were used to discover SNPs, which 
allowed mapping of the RGA (Quint et al. 2002). In barley, cytochrome P450 genes extracted 
from the International Triticeae EST Cooperative database were used to identify SNPs in 
barley varieties (Bundock et al. 2003). 

For genome-wide discovery of SNPs, extensive databases are available that 
allow electronic SNP (eSNP) discovery in shotgun genomic libraries, and in 
expressed sequence tag (EST) libraries. 

Tools for single nucleotide polymorphism detection, like e.g. Polybayes (http://genome. 
wustl.edu/groups/informatics/software/polybayes) or SNPFINDER (http://www.ktl.fi/ 
bioinfo/maps.html) are available for public research in the Internet. These programmes 
allow one to cluster overlapping ESTs, filter possible pseudogenes or paralogues, and check 
trace files, which otherwise must be done by hand. Brett et al. (2000) recommend following 
three rules to locate SNPs with high accuracy: ESTs should be aligned using BLASTN with 
an ‘expected value’of le-30. Then, only ESTs with over 95% identity over 100 bp should be 
considered to avoid processing pseudogenes and close homologues. In addition, the at least 
two different ESTs rule’ should be employed, meaning that in a case where several ESTs 
match a sequence, at least two of them must have the same nucleotide change that is different 
from the majority of ESTs matching the sequence at that point. 

Current genome sequencing projects in rice, maize, wheat and other crop 
species are providing increasing amounts of sequencing data, which will 
accelerate SNP discovery by comparative alignment using databases. 



2.2 Distribution of SNPs in Plant Genomes 

In plant species, sequence variation is widespread (Table 1). lrvArabidopsis> 
37,344 SNPs and 18,579 small InDels (http://www.arabidopsis.org/cereon; 
Jander et al. 2002) have been identified. In rice, investigations of a 2.3-Mb 
homologous region between the two subspecies japonica and indica iden- 
tified 9,056 SNPs in each and 63 InDels in indica and 138 in japonica (Han 
and Xue 2003). These data demonstrate that SNPs can provide a much 
larger number of markers for plant breeding than any other known marker 
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Table 1. Distribution of single nucleotide polymorphisms (SNPs) in higher plants. Frequen- 
cies of SNPs per base pair (bp) are given based on sequence analysis. If data were available, 
frequencies are separately presented for coding and non-coding regions, as SNPs seem to 
be more abundant in non-coding regions of most plants 


Plant species 


Population type 


Frequency 


Reference 


Arabidopsis thaliana 


Ecotypes 
Coding regions 
Non-coding regions 


1 SNP/3,100 bp 
1 SNP/2,200 bp 


Jander et al. (2002) 


Beta vulgaris 


Breeding lines 


1 SNP/130 bp 


Schneider et al. (2001) 


Glycine max 


Various genotypes 
Coding regions 
Non-coding regions 


1 SNP/610 bp 
1 SNP/206 bp 


Zhu et al. (2001) 


Hordeum vulgare 


Barley varieties 


1 SNP/189 bp 


Kanazin et al. (2002) 




Barley lines 
(cytochrome 
P450 genes) 


1 SNP/131 bp 


Bundock et al. (2003) 




Barley varieties 


1 SNP/58 bp 


Neuhaus et al. (2003) 


Oryza sativa 


Oryza sativa ssp. 
japonica 
(three cultivars) 
Oryza sativa ssp. 
indica 

(two cultivars) 
Oryza rufipogon 


1 SNP/89 bp 


Nasu et al. (2002) 




Oryza sativa 
accessions 
Oryza sativa ssp. 
japonica 
Coding regions 
Non-coding regions 
Oryza sativa ssp. 
indica 

Coding regions 
Non-coding regions 


1 SNP/100 bp 

1 SNP/304 bp 
1 SNP/272 bp 

1 SNP/379 bp 
1 SNP/315 bp 


Garris et al. (2003) 
Han and Xue (2003) 


Saccharum 

officinarum 


Coding region 


1 SNP/122 bp 


Grivet et al. (2003) 


Zea mays 


Maize exotic land- 
races and inbred lines 
Maize inbred lines: 
Coding regions 
Non-coding region 
Maize inbred lines: 
Resistance gene 
analogs 

Maize inbred lines: 
3’Untranslated 
region 

Coding regions 


1 SNP/104 bp 

1 SNP/124bp 
1 SNP 31 bp 

1 SNP / 33 bp 

1 SNP / 48 bp 
1 SNP/ 130 bp 


Tenaillon et al. (2001) 
Ching et al. (2002) 

Quint et al. (2002) 

Bhattramakki 
et al. (2002) 
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system, so far. In theory, several SNPs should be detectable within each gene 
of interest and should allow allele differentiation. The lack of polymor- 
phisms linked to valuable traits had been a limiting factor in marker-as- 
sisted breeding, which can be overcome by using SNPs as markers. 



3 Overview of Different Methods for SNP Detection 
in Plant Breeding 

Especially, in marker-assisted plant breeding, where thousands of geno- 
types have to be screened every year, novel and suitable assays must be fast, 
inexpensive, multiplexable and amenable to automatisation to allow high 
throughput (Bormans et al. 2002). Recent advances in SNP technologies 
provide screening methods that fulfil these criteria. Table 2 gives an over- 
view of SNP technologies and their applications in plants. 

Table 2. Mutation analysis in crop cultivars by different SNP detection methods. (BESS-T/G, 
base excision sequence scanning; CAPS, cleaved-amplified polymorphic sequence; DHPLC, 
denaturing high-performance liquid chromatography; EST, expressed sequence tag, 
MALDI-ToF MS, matrix assisted laser desorption/ionisation time-of-flight mass spectrome- 
try: PCR, polymerase chain reaction; SnuPE, single nucleotide primer extension; SSCP, 
single-strand conformation polymorphism; SSR. simple sequence repeats) 



Crop cultivar/ 
plants 


SNP detection 
method 


Application 


Reference 


Alopecurus 

myosuroides 

Huds. 


Sequencing, allele- 
specific PCR 


Genotyping of resistance 
to graminicidal herbicides 


Delye et al. (2002) 


Arabidopsis 

ecotypes 


Heteroduplex 

analysis 


Generation of co-dominant 
PCR-based markers 


Hauser et al. (1998) 




Oligonucleotide 

microarray 


Positional Cloning 


Cho et al. (1999); 
Spiegelmann 
et al. (2000) 


Arachis 
hypogae L., 
wild species 


Sequencing 


Phylogenetic relationship 


Jung et al. (2003) 


Beta vulgaris 


SSCP 


Segregation analysis of 
functional gene homologues 


Schneider et al. 
(1999, 2001) 


Eucalyptus 
urophylla , 
E. grandis 


SSCP 


Characterisation of QTL 
for wood quality 


Gion et al. (2000) 


Glycine max 


TaqMan™ 


SNP screening 
of inbred lines 


Meksem et al. 
(2001b) 
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Table 2. Continued 



Crop cultivar/ 
plants 


SNP detection 
method 


Application 


Reference 


Hordeum 
vulgare L. 


DHPLC 


Genome-wide 
SNP analysis 


Kota et al. (2001) 




BESS-T/G 


Resistance against barley 
yellow mosaic virus 


Neuhaus et al. (2003) 




SNuPE/MALDI- 
ToF MS 


SNP typing for mlo 
powdery mildew resistance 


Paris et al. (2001) 




Sequencing 


Genome-wide haplotypes 


Kanazin et al. (2002) 


Oryza sativa 


Sequencing 


Differentiation of amylose 
content in cultivars 


Ayres et al. (1997) 




READIT™ 


SNP genotyping of granule 
bound starch synthase 


Bormans et al. 
(2002) 


Saccharum 
officinaruniy 
S. spontaneum 


Sequencing, 
Alignment of EST 
sequences 


SNP screening in the Adh 
gene family 


Grivet et al. (2003) 


Solanum 

tubersum 


Pyrosequencing 


SNP genotyping in 
polyploid species 


Rickert et al. (2002) 


Zea mays L. 


Sequencing 


Genetic diversity 


Tenaillon et al. 
(2001) 




EST-Mapping/ 
Pyrosequencing ™ 


SNP screening of three 
different loci 


Ching and 
Rafalski (2002) 




Hybridization 

assay 


Genotyping of different 
inbred lines 


Mogg et al. (2002) 




CAPS 


Resistance gene analogs 


Quint et al. (2002) 




SNuPE assay 


Genotyping SNPs in the 
flanking regions of SSRs 


Batley et al. (2003) 



3.1 SNP Detection Methods for Pre-Screening 

Technologies to identify unknown SNPs are mostly gel-based methods 
characterised by a moderate throughput but relatively low start-up costs 
(Jander et al. 2002). Pre-screening of amplicons for SNPs prior to verifica- 
tion of a SNP by sequence analyses proved to be most economic for 
development of SNP assays. 

Single-Strand Conformation Polymorphism (SSCP) analysis and Tem- 
perature Gradient Gel Electrophoresis (TGGE) represent simple and inex- 
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pensive techniques (White et al. 1992) based on the principle that an altered 
DNA conformation, caused by a single point mutation, affects the mobility 
of these fragments during electrophoretic separation. Modified sequencing 
techniques, such as Base Excision Sequence Scanning (BESS-T/G sequenc- 
ing), are also available tools for SNP pre-screening (Hawkins and Hoffman 
1997; Karp et al. 1997; Yu et al. 2001). 

These gel-based, technically simple and well-established assays are the 
most suitable tools for pre-screening of a number of germplasms or geno- 
types to get an indication of SNP occurrences or frequencies within un- 
known sequences. However, only short fragments ranging from 1 00-400 bp 
can be analysed for SNP frequencies by these technologies and the capabil- 
ity for multiplex assays is limited. 



3.2 Present Technologies for Allelic Selection in Plant Breeding 

Allele differentiation including discrimination of homozygotes and het- 
erozygotes is essential for marker-assisted selection in plant breeding. 
Various methods based on SNPs have been developed, which allow distinc- 
tion of different alleles. 

Markers identifying alleles based on SNPs that alter restriction sites are 
called Cleaved-Amplified Polymorphic Sequence (CAPS) markers 
(Konieczny and Ausubel 1993). After PCR amplification, PCR products are 
cleaved with the appropriate restriction enzyme, and separated on agarose 
gels. If no useful restriction site is affected, allele-specific primers contain- 
ing one of the bases of the SNP can be designed to genotype respective 
polymorphisms. However, allele-specific PCR requires sequence informa- 
tion regarding the SNP and a detailed optimisation of the amplification 
conditions is necessary to exclude false-positives. In a modified allele-spe- 
cific amplification procedure (Single Nucleotide Amplified Polymorphism 
- SNAP), one additional mismatch, apart from the one separating the 
alleles, is introduced, which considerably enhances the selectivity of the 
primers and thereby maximizes the efficiency to differentiate alleles of the 
SNP (Drenkard et al. 2000). 

A newly developed assay for SNP genotyping is the READIT system. In 
the presence of a high concentration of pyrophosphate, the DNA polym- 
erase catalyses depolymerisation of the template DNA when there is a 
perfect match at the 3’end between the probe and the target sequence, while 
the mismatched probe is not depolymerised. Released dNTPs are converted 
enzymatically to ATP, which activates a luciferin/luciferase detection sys- 
tem. Using a probe specific for each allele, homozygotes and heterozygotes 
can be distinguished. The TaqMan assay also uses allele-specific oligonu- 
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cleotides, but conjugated with a pair of fluorophores. In an intact unhy- 
bridised TaqMan probe, the fluorescence is quenched by fluorescence 
resonance energy transfer (FRET), where upon cleavage of the hybridised 
T aqMan probe by 5’exonuclease activity of T aq polymerase the reporter dye 
is separated from the quencher. The TaqMan-assay is designed to detect 
SNPs in a high-throughput manner. By using more than one reporter dye, 
it is also possible to detect different alleles of a SNP in a single reaction. 
Alternative approaches for screening single sequence variations are assays 
based on single nucleotide primer extension (SNuPE), including mini-se- 
quencing approaches that are well established techniques for screening 
polymorphisms (Pastinen et al. 1997; Syvanen 1999; Shapero et al. 2001). 

A large number of technologies based on hybridisation and primer 
extension assays has been developed to genotype SNPs (Jander et al. 2002). 
These types of assays characterised by a high potential for automatisation 
and increased throughput, are rapid and sensitive systems and therefore 
promising technologies in future plant breeding, but often require special 
equipment, which might only be affordable for big breeding companies. 



3.3 Advanced Technologies for SNP-Screening 

SNP analysis becomes most powerful if used in genome-wide approaches 
and on a high-throughput level. For large-scale screening of SNPs, methods 
based on primer extension as described above can be combined with highly 
automated techniques such as Matrix Assisted Laser Desorption/Ionisation 
Time-of-Flight Mass Spectrometry (MALDI-ToF MS) that analyses exact 
physical characteristics, such as molecular mass of a nucleotide (Kim et al. 
2002). High-throughput analysis can also be performed by using DHPLC, 
which allows the detection of SNPs through different retention time of 
heteroduplex and homoduplex DNA in reverse-phase HPLC under partially 
denaturing conditions (Spiegelman et al. 2000). Pyrosequencing (Ronaghi 
et al. 1998; Ahmadian et al. 2000; Alderborn et al. 2000), which utilizes an 
enzyme cascade system to analyse large numbers of DNA sequences and to 
determine allele frequencies, can be automated as well. A luminometric 
detection system is used to measure the pyrophosphate that is released 
upon nucleotide incorporation. Because 20 or more nucleotides are deter- 
mined by this method, it is possible to detect several closely linked SNPs at 
once. 

Microarrays offer the highest potential for large-scale detection of poly- 
morphisms and for high throughput (Schafer and Hawkins 1998; Aharoni 
and Vorst 2001). DNA microarrays are currently produced and assayed by 
two main approaches involving either in situ synthesis of oligonucleotides 
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(‘oligonucleotide microarrays’) or deposition of pre-amplified DNA frag- 
ments (‘cDNA microarrays’) on solid surfaces (reviewed by Aharoni and 
Vorst 2001). 

A wide field of applications in plant breeding using these novel, non-gel 
based screening methods is imaginable in the future for determination of 
genetic diversity in higher plants, especially crops, or gene discovery and 
expression analysis (Aharoni and Vorst 2001). Especially, the development 
of high density arrays enabling a fast and precise detection of many markers 
at the same time (multiplex application) will provide great possibilities for 
effective large-scale genotyping or genome-wide screening of genotypes. 



4 Application of Single Nucleotide Polymorphisms 
4.1 Assessment of Genetic Diversity by SNPs 

Nucleotide diversity is the average number of nucleotide differences per 
site between two sequences. A number of factors, e.g. mutation rate, popu- 
lation size, mating type, balancing selection, background selection, popu- 
lation structure, have an impact on nucleotide diversity (Buckler and 
Thornsberry 2002). In maize (Zea mays ssp. mays), nucleotide diversity at 
silent sites averages 1.6% for genes that appear to behave neutrally, while 
the diversity in maize’s wild relative Z. mays ssp. parviglumis is about 2% 
(White and Doebley 1999; Gaut et al. 2000). At individual loci, the diversity 
ranged from 0.2 to 5% (White and Doebley 1999). There has been roughly 
a drop of 30% in diversity at the average locus from maize’s relatives. 
Compared to other grass species, maize and its wild relatives seem to have 
a high level of genetic diversity, which is probably the result of high levels 
of outcrossing (Buckler et al. 2001). 

Whitt et al. (2002) analysed six key genes, amylose extender 1 (ael) y brittle2 (bt2) y shrunkenl 
(shl) y shrunken2 ( sh2) y sugary 1 ( sul ), and waxyl (wxl) y known to play a major role in starch 
production in maize. Although the starch loci showed a wide range in diversity, average 
diversity in the starch loci was 2.3-fold lower than in 29 random maize loci at silent sites 
(Whitt et al. 2002), and 4.8-fold lower at non-synonymous sites (Tenaillon et al. 2001). For 
chromosome I in maize, Tenaillon et al. (2001) related nucleotide diversity to chromosome 
structure, recombination and various types of selection. Population effects in the production 
of the breeding material in maize are also discussed. 

In general, the domesticated relatives of the grasses (maize, sorghum, rice, 
oats, pearl millet) have two-thirds of the diversity found in wild relatives 
(Buckler et al. 2001). However, there has probably been a greater loss in 
terms of alleles for agronomic use, as nucleotide diversity estimates are 
relatively insensitive to the loss of rare alleles (Buckler et al. 2001). 
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4.2 Association Mapping 

4.2.1 Association and Population Structure 

Association of a SNP with a trait may be the result from at least three 
possible causes (Risch and Merikangas 1996; Przeworski et al. 2000): (1) the 
locus carrying the SNP is the cause of the phenotype, (2) the locus is in 
linkage disequilibrium (LD) with the cause of the phenotype, meaning 
linked and highly correlated with the phenotype but not representing the 
gene itself, which would be still perfect for plant breeding purposes, and (3) 
the population structure produces the association. To avoid any false 
association due to the population structure, populations used for associa- 
tion mapping should at the same time be characterised by random distrib- 
uted markers, e.g. SSR markers. In maize, Remington et al. (2001) used SSR 
markers representing 47 highly polymorphic loci with a mean of 6.85 alleles 
per locus to define their maize population structure. Garris et al. (2003) 
employed 22 SSR markers to characterise the population structure of 114 
rice accessions. Association approaches can be used to rapidly evaluate 
candidate genes with high resolution but it is essential to determine and 
understand the underlying population structure and to correct for it 
(Pritchard et al. 2000; Nordborg and Innan 2002). 



4.2.2 Candidate Gene Approach Versus Genome-Wide Mapping 

Linkage Disequilibrium (LD) is the non-random association of alleles. 
Values for LD index D’ range from 0 (equilibrium) to 1 (disequilibrium). 
The effectiveness of association mapping depends on the extent of LD 
around the gene of interest. Islands of linkage equilibrium are large seg- 
ments (up to the order of tens of kb) that are characterized by low recom- 
bination, high LD and a small number of haplotypes that are interspersed 
with regions in which recombination is frequent (Daly et al. 2001; Goldstein 
2001; Rafalski 2002a). Factors that influence the linkage disequilibrium are 
the rate of outcrossing, the degree of selection, recombination rate, chro- 
mosomal location and the sampling, which determines the population 
structure (Garris et al. submitted). Investigating six genes in maize, 
Remington et al. (2001) detected that intragenic LD generally declined 
rapidly with distance (r 2 <0.1 within 1,500 bp) but rates of decline were 
highly variable among genes. This extension of LD is much less than the 
50 kb predicted for humans (Koch et al. 2000) where it is even assumed to 
extend much further (Moffatt et al. 2000; Reich et al. 2001). Only for sugary 1 
(sul) evidence was found that LD might persist in maize at anywhere near 
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these distances (Remington et al. 2001). This study of the local and genome- 
wide LD pattern based on six genes included 102 maize inbred lines repre- 
senting much of the worldwide genetic diversity used in maize breeding. As 
there exists a clear relationship between the extent of LD and the resolution 
of association studies (Rafalski 2002a) these results allow to estimate future 
prospects of association mapping in maize. If LD declines slowly with 
increasing distance from the gene of interest, a low density of markers is 
sufficient to identify the associated markers. However, if LD very rapidly 
declines around the gene of interest a much higher density of markers is 
required to identify an associated marker (Rafalski 2002a). 

The rapid breakdown of LD in maize will be favourable for association mapping of candidate 
genes that are located near mapped Quantitative Trait Loci (QTL) but the decay of LD is 
probably too rapid to allow a genome-wide association testing with SNPs as has been 
proposed for human populations (Reich et al. 2001; Remington et al. 2001). According to 
Tenaillon et al. (2001), intragenic LD even declines within 100-200 bp in maize. Whereas 
Remington et al. (2001) looked at maize inbred lines, Tenaillon et al. (2001) compared US 
landraces and inbred lines. In Arabidopsis , LD is more extensive, decaying within perhaps 
250 kb (Nordborg and Innan 2002). This is consistent with the difference in breeding system 
between these two species: maize undergoes outcrossing, whereas Arabidopsis is highly 
selfing, and selfing is expected to increase LD greatly (Nordborg 2000). In sugarcane, LD 
extends for several cM (Jannoo et al. 1999). 

As A. thaliana shows extensive LD, a genome-wide mapping of SNPs is 
favourable (Rafalski 2002a). Cho et al. (1999) constructed a biallelic genetic 
map in Arabidopsis with a resolution of 3.5 cM and investigated its use in 
mapping of Edsl6> a gene involved in the defence response to the fungal 
pathogen Erysiphe orontil Edsl6 could be placed to a 7-cM interval on 
chromosome 1. The results demonstrate that a whole-genome mapping 
using array-based genotyping of SNPs can accelerate the process of posi- 
tional cloning (Cho et al. 1999). 

Based on sequence information present in Arabidopsis 1,800 PCR products, averaging 
280 bp, were amplified from both the Columbia and Landsberg erecta ecotypes. Variation 
was analysed by investigating the heteroduplex formation of Columbia-Landsberg PCR 
products by DHPLC. Using this method, 487 loci carrying SNPs were identified and se- 
quenced afterwards to confirm the mutations between the two ecotypes. After screening for 
optimal melting temperatures and absence of AT-rich regions in the vicinity of the poly- 
morphic base 412 polymorphisms were used to generate an oligonucleotide array. A total 
of 237 SNP markers were placed on the map in Arabidopsis thaliana (Cho et al. 1999). 

According to Drenkard et al. (2000) map-based cloning is greatly facilitated 
in Arabidopsis by the possibility to use SNP markers as demonstrated in 
mapping the mutation edr5- 1, which causes enhanced disease resistance to 
Pseudomonas syringae pv maculicola ES4326 and to Erysiphe orontii. 
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In maize, a two-step strategy of QTL mapping followed by association 
testing of positional candidate genes shows substantial potential for local- 
ising quantitative trait effects to individual genes or even subgenic regions 
(Thornsberry et al. 2001). In a candidate gene approach, the dwarf 8 gene 
could be significantly associated with flowering time in maize (Thornsberry 
et al. 200 1 ) . The rapid decay of LD in maize provides an opportunity to map 
quantitative trait loci with up to 5,000-fold greater resolution than current 
mapping with F 2 or recombinant inbred lines (Remington et al. 2001). 

Association studies in rice showed that linkage disequilibirium in the 
70-kb xa5 region, conferring resistance to Xanthomonas oryzae pv. oryzae , 
was extensive but still informative in eliminating three of eight candidate 
genes for xa5 (Blair et al. 2003). 

The application of association studies in plant breeding has the potential 
to be extremely efficient for studies of agronomically important traits. 



4.2.3 Haplotype-Specific SNP 

The fluidity of genomic parameters, such as linkage disequilibrium, ques- 
tions the applicability of genome-wide studies, which assume that SNPs, 
randomly distributed over the genome, will be sufficient to detect an 
association with a phenotype. However, screening of SNPs on a genome- 
wide scale is interesting because haplotype analyses of multiple SNPs are 
especially powerful for investigating alleles responsible for genetically com- 
plex diseases (Stephens et al. 1998; Tishkoff et al. 2000). A haplotype is the 
specific combination of nucleotides, one from each polymorphic sites that 
are present on an individual chromosome (J. C. Stephens et al. 2001). 
Computer programmes using Clarks algorithm (Clark 1990) can be used to 
analyse the combination of alleles present at each site of polymorphism in 
each individual that assign a specific pair of haplotypes to each individual, 
as well as a score reflecting the confidence in that assignment. SNPH AP (EM 
algorithm) developed by David Clayton and PHASE (Bayesian statistics) 
developed by M. Stephens et al. (2001) are also available (Zhang et al. 2002). 
As more SNP marker data become available in humans, it is becoming 
preferable to use haplotypes of markers for association analysis of candi- 
date genes rather than individual polymorphisms (Clark et al. 2001). For 
example, J.C. Stephens et al. (2001) investigated SNPs present within 313 
genes in humans and organised these SNPs into 4,304 haplotypes. 

Haplotypes can correlate a specific phenotype with a specific gene in a 
small population sample even when individual SNPs cannot (Drysdale et 
al. 2000). Thus, attempts to draw associations between phenotypes and 
genomic variation are more likely to succeed when the SNPs used in such 
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studies have been confirmed to be in linkage disequilibrium by methods 
such as haplotyping (J.C. Stephens et al. 2001). Current haplotype mapping 
focuses on haplotypes with > 5% population frequency (Zhang et al. 2002). 

Existing genome viewers, such as NCBI Map Viewer and the UCSC Genome Viewer (Kent 
et al. 2002) as well as static genotype or haplotype viewers such as Visual Genotype or Visual 
Haloptype (Nickerson et al. 1998; Rieder et al. 1999) are capable of presenting donor 
genotype or haplotype information but lack more sophisticated features, such as SNP 
functional classification, haplotype phase probability assessment, display of population 
haplotype structure and haplotype blocks (Zhang et al. 2002). The software system Hap- 
Scope represents a tool for analysing the genetic and functional correlation of haplotypes 
in a population (Zhang et al. 2002). The minimum SNP set selection algorithms define the 
minimum SNP set, which is the smallest possible subset of SNPs in a haplotype block that 
represents the diversity of haplotypes within the block: such SNPs have also been referred 
to as ‘haplotype tagging SNPs’ or htSNPs’ (Johnson et al. 2001). 



5 Conclusions 

In plants, SNP technology has not yet been exploited in any comparable 

form to humans or animals. However, the results obtained in plants allow 

the following conclusions: 

1. The high frequency of SNPs in plant genomes makes SNPs a very 
interesting universal marker systems, also for plants. Using gel-based 
methods, which have relatively low start-up costs, SNPs can be applied 
in a moderate throughput. 

2. Applications of SNPs become the most powerful tool in genome-wide 
approaches on high-throughput levels. This has been extremely facili- 
tated by whole genome sequencing of model species like Arabidopsis and 
rice and by the availability of large EST and SNP databases for important 
crop species. 

3. For high-throughput techniques, the initial high equipment costs will be 
a problem for laboratories that do not need to perform large numbers of 
genotyping reactions on routine basis (Jander et al. 2002). 

4. One big challenge in plant breeding is the handling of quantitative traits. 
A two-step strategy of QTL mapping, combining marker-assisted QTL 
localisation with association testing of positional candidate genes, shows 
substantial promise for addressing quantitative trait effects to individual 
genes or even subgenic regions (Remington et al. 2001; Thornsberry et 
al. 2001). 

5. However, the power to detect association between an SNP and a quan- 
titative trait ultimately depends on having a sufficient density of SNP 
markers to ensure that some SNPs will be in LD with the allele that 
contributes to the phenotypic variation (Tenaillon et al. 2001). Associa- 
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tion mapping of SNPs from candidate genes will be more promising in 
plants than whole genome-wide association studies. 

6. In plants, genome-wide mapping approaches as proposed for humans 
are less attractive due to the very high number of SNP markers needed. 
In plant breeding, haplotype-specific SNP markers for candidate genes 
represent a very promising marker tool with regard to marker-assisted 
selection of different traits, e.g. disease resistances. 

7. The rapid development of SNPs technologies for high-throughput ap- 
plications will make SNP the most interesting molecular marker system 
for plant research and plant breeding in the near future. SNP technology 
will help to handle complex traits and to resolve the genes involved in 
these traits. 
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1 Introduction 

Large-scale genome analysis and related technologies provide access to a 
refined understanding of the genome. More and more, the systematic 
molecular analysis of structure, function, and regulation of plant genomes 
opens new perspectives for future developments and their application in 
the plant sciences, i.e. production of new and better varieties. The increas- 
ing amount of information on DNA documented in molecular marker 
collections and in dense gene maps, together with an excellent bioinforma- 
tion system, increasingly allows the development of engineering strategies 
towards superior cultivars. Common principles, e.g. in defence reactions to 
biotic and abiotic stress, offering strategies like the candidate gene ap- 
proach, have been set out (Liibberstedt et al. 2002). These developments 
help in the two phases of plant breeding: the production and use of vari- 
ability called the ‘evolutionary phase’, as well as in selection, called the 
‘evaluation phase’ (Khush 2002). Molecular markers are a powerful tool for: 
(1) higher efficiency in parental selection, allowing a controlled combina- 
tion of better combining and thus heterotic parents and (2) pyramiding of 
single traits to result in more complex characters (Ranjekar et al. 2002). 
Additionally, they are presently the most efficient system for uncovering a 
rare but desired genotype in a large, segregating population. 

The tools used today not only for investigation of the model plant Arabidopsis thaliana but 
increasingly in genotype building programs by breeding companies are: (1) bio informatics, 
i.e. database comparisons of functional domains of protein families; (2) expression analysis, 
i.e. analysis of the transcriptome, including inventory of expressed transcripts, global 
expression analysis using cDNA arrays, straightforward identification of genes of interest 
by differential techniques, and (3) the use of mutant collections (knock-out mutants up to 
complete gene machines). 

Molecular markers are virtually unlimited in their number, detectable at all 
plant developmental stages showing no pleiotropic effects. However, one 
of their greatest advantages is that they can be used to dissect quantitatively 
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inherited traits into single Mendelian factors. Thus, molecular markers are 
of great value in applying genomics to crop improvement (Tanksley 1993; 
Mohan et al. 1997; Wenzel 1998; Snowdon et al. 2002). 



2 Range of Molecular Marker Types 

Marker types are numerous; comparisons and evaluations of their useful- 
ness, reproducibility and costs have been well documented (Powell et al. 
1996; Jones et al. 1997; Virk et al. 2000). A choice for a particular marker 
system needs to be made for each specific purpose. For selection of the most 
suitable system, it is important to consider that a high degree of polymor- 
phism is revealed and the entire genome is covered (Bohn et al. 1999). 
Historically, the development of molecular markers started with restriction 
fragment length polymorphism (RFLPs, Botstein et al. 1980), followed by 
the first PCR-based system: random amplified polymorphic DNA (RAPD, 
Welsh and McClelland 1990; Williams et al. 1990). PCR-based methods 
offer the striking advantage of being rapid and requiring only tiny bits of 
plant material for DNA extraction (McGregor et al. 2000). When entire 
genome sequences of major crops become known, as e.g. the rice genome 
in 2002 (Goff et al. 2002; Yu et al. 2002), single nucleotide polymorphism 
(SNP) markers are likely to become most important in research aimed at 
crop improvement. 



2.1 Restriction Fragment Length Polymorphism 

In complex plant genomes, restriction fragment length polymorphisms 
(RFLPs) are detected upon hybridization of (mainly) single-copy DNA 
probes to restriction enzyme-hydrolyzed, agarose gel electrophoresis-sepa- 
rated and nylon membrane-bound genomic DNA. For the detection of 
polymorphisms, the DNAs of the genotypes to be surveyed are digested 
usually with various restriction enzymes, and marker alleles are identified 
by size differences of the restriction fragments to which the probes hybrid- 
ize. Thus, RFLP markers are specified by a single clone/restriction enzyme 
combination. Major sources of RFLP probes are species-specific genomic 
DNA and cDNA sequences. Heterologous probes can be used to link 
genomes from species of the same family (Paterson et al. 2000), and even 
of different plant families (Fulton et al. 2002). Using RFLPs, detailed genetic 
maps of many crop plants have been constructed, e.g. tomato and potato 
(Tanksley et al. 1992), rice (Kurata et al. 1994), maize (Davis et al. 1999) or 
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barley (Kleinhofs and Graner 2001). The principle and technique of RFLP 
production have often been described (Gupta et al. 2002b). 

RFLP analysis is a time-consuming and labour-intensive task and, there- 
fore, not suitable for the rapid genotyping of large segregating populations 
used in a breeding program. An additional disadvantage is the large quan- 
tity of genomic DNA (10 pg or more) needed for target fragment detection. 
Due to superiority and the cost-effective nature of PCR-based marker 
systems, RFLPs lost their importance in genomics without becoming ap- 
plied in practice. RFLPs represent still, however, a highly reproducible and 
reliable marker system that can be used to obtain anchor points for consen- 
sus maps. Moreover, the development of simple PCR, also termed se- 
quence-tagged site (STS) markers, or SNP (single nucleotide polymor- 
phism) markers from sequenced RFLP probes may provide an opportunity 
to determine useful polymorphisms found in previous gene mapping 
studies. 



2.2 Random Amplified Polymorphic DNA 
and DNA Amplification Fingerprinting 

The use of a single, short oligonucleotide primer of random sequence 
(usually ten-base oligomers of varying GC content, ranging from 40 to 
100%) in a low-stringency PCR results in random amplified polymorphic 
DNA (RAPD) which can be directly visualized in conventional agarose gel 
electrophoresis (Williams et al. 1990). The condition precedent to amplifi- 
cation is the occurrence of short inverted repeats of the same random 
sequence on opposite DNA strands within an amplifiable distance. Longer 
primers of up to 30 nucleotides were proposed by Welsh and McClelland 
(1990), shorter ones of 5-8 nucleotides by Caetano-Anolles et al. (1991; 
Caetano-Anolles 2001). With the reduction of the primer length, the 
number of amplified products increases, making this technique useful for 
fingerprinting approaches and thus being named DNA amplification fin- 
gerprinting (DAF). Polymorphisms between individuals result primarily 
from sequence differences in one or both primer sites or from indels 
(insertions-deletions) exceeding the PCR-amplifiable distance, and are 
recognizable by the presence or absence of a particular RAPD band. 

Though RAPDs have some striking advantages such as low cost, the 
small amount of DNA needed (10-100 ng) and the use of universal primers, 
the DNA profiles often are prone to non-reproducibility between laborato- 
ries due to variations in DNA quality and PCR conditions, and different 
models of thermo-cyclers. In replicability studies by Perez et al. (1998), 
mispriming errors amounted to up to 60%. To overcome this problem, 
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Paran and Michelmore (1993) converted RAPD fragments to simple and 
robust PCR markers termed sequence-characterized amplified regions 
(SCARs). Primer pairs are deduced from cloned RAPD fragments usually 
by extending the original decamer primer sequence by 10-15 bases. 



2.3 Amplified Fragment Length Polymorphism 

Amplified fragment length polymorphism (AFLP, Vos et al. 1995) is based 
on PCR amplification of a set of restriction fragments under high stringency 
conditions from a pool of fragments that is generated through digestion 
with a pair of restriction endonucleases, one of them being a frequent, 
four-base cutter (e.g., Msel, Taql), and the other being a rare, six-base cutter 
(e.g., EcoRly Pstly Hindlll). Another widespread enzyme system substitutes 
six- for eight-base cutting enzymes, such as Sse8387I or its isoschizomer 
Sdal (Law et al. 1998; Hard et al. 1999). To allow for PCR tagging of 
restriction fragments, oligonucleotide adapters are ligated to the ends of 
the DNA fragments. The number of DNA fragments to be amplified can be 
restricted by adding two to three selective nucleotides to the core sequence 
of the primers which is composed of adapter- and restriction-site homolo- 
gous sequences. The option to permute the order of these selective bases 
and to recombine the primers with each other theoretically offers the 
possibility of the gradual collection of all restriction fragments of a particu- 
lar enzyme combination showing a suitable size for PCR fragment analysis 
from a genotype. Thus, this method generates many bands, but not too 
many to create difficulties in scoring: a well-balanced number of amplified 
restriction fragments ranges from 50-150. The AFLP process may be im- 
proved by using a multifluorophore detection technique (Schwarz et al. 
2000). In comparison to conventional 33 P-based AFLP analysis the multi- 
fluorophore technique allows multimixing as well as simultaneous ampli- 
fication of AFLP fragments with different primer combinations in one 
reaction (multiplexing). Today, AFLP is the most efficient technique for 
detecting polymorphisms. The polymorphic bands from AFLP patterns 
may also be converted into simple PCR markers (Shan et al. 1999) suitable 
for marker-assisted selection procedures. 

A cDNA-AFLP technique (Bachem et al. 1996), with the standard AFLP protocol on a cDNA 
template, was used for the construction of genome-wide transcriptome maps (Brugmans et 
al. 2002). The AFLP technique can be modified so that one primer is obtained from a known 
multi-copy sequence to detect sequence-specific amplification polymorphisms (S-SAP). 
This approach has been used successfully to generate genome-wide Bare-1 retrotransposon- 
like markers in barley (Waugh et al. 1997) and Avena (Yu and Wise 2000) as well as in alfalfa 
by making use of consensus sequences from long terminal repeats (LTRs) of Tmsl retro- 
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transposon (Porceddu et al. 2002). Based on the LTR regions of retrotransposable elements 
Tpsl2 and Tpsl9 of pea (Pearce et al. 2000) and Tyl-copia of sweet potato (Berenyi et al. 
2002) S-SAP systems were established for their use in genetic diversity studies. 



2.4 Simple Sequence Repeat 

Simple sequence repeats (SSRs) or microsatellites are tandem repeats of 
short nucleotide sequence motifs (1-6 bp) and ubiquitous in eukaryotic 
genomes (Tautz and Renz 1984). SSR loci are extremely variable in the 
number of repeat units among individuals of a given species and can be 
easily typed via STS-PCR (Weber and May 1989). The opportunity to tag 
highly variable DNA sequence motifs, which are evenly spaced throughout 
the genome as simple PCR markers, represented an ideal resource for 
developing a marker class to be applied in practical breeding. As a conse- 
quence, for most important crops, a huge number of SSR markers are 
publicly available for research, such as potato (Milbourne et al. 1998), wheat 
(Roder et al. 1998; Gupta et al. 2002a; Song et al. 2002), soybean (Cregan et 
al. 1999), barley (Liu et al. 1996; Ramsay et al. 2000), rice (Temnykh et al. 
2000), and maize (Sharopova et al. 2002). Furthermore, these short repeti- 
tive sequence elements also occur within genes as demonstrated by search- 
ing EST (expressed sequence tag) databases for the presence of microsatel- 
lites (wheat: Eujayl et al. 2002; rye: Hackauf and Wehling 2002; barley: Thiel 
et al. 2003). EST-derived SSRs are expected to be slightly less polymorphic 
than SSRs derived from genomic libraries, as there is pressure for sequence 
conservation in coding regions. 



2.5 Single Nucleotide Polymorphism 

The primary focus for the development of the next generation of genetic 
markers is on single nucleotide polymorphism (SNP). An SNP is a single 
nucleotide position in a defined DNA stretch at which there is variation 
between different individuals within a species. Thus, single base inser- 
tion/deletion (indels) variants would not formally be considered to be SNPs 
(Brookes 1999). SNPs are of particular interest for their utilization in crop 
improvement, since they (1) represent the most frequent variations in the 
genome of any organism, thus offering the potential of finding informative 
markers for a distinct genomic region in any genetic background and (2) 
can be simply treated as diallelic markers and, therefore, make them ame- 
nable to automated high-throughput genotyping and data handling. 
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The determination of SNP genotypes can be preferentially performed 
using non-gel-based technology platforms such as denaturing high-per- 
formance liquid chromatography (DHPLC, Oefner and Underhill 1998), 
which is based on the detection of heteroduplex formation between mis- 
matched nucleotides of mixed double-stranded PCR samples by ion-paired 
reversed phase chromatography under partially heat-denaturing condi- 
tions. Kota et al. (2001) previously reported on the DHPLC mapping of 
EST-SNPs in the barley genome. Another biochemical reaction principle 
for SNP genotype calling is mini-sequencing which can be used either as 
single-base primer extension (Syvanen 1999) or in the pyrosequencing 
format (Ahmadian et al. 2000). 



3 Use of Genetic Markers in Plant Breeding 

Breeders use three kinds of markers for a successful selection: morphologi- 
cal, biochemical, and molecular. The clearest and up till now most success- 
ful ones are the morphological markers, since they have been the characters 
on which Mendel elaborated the Mendelian laws. They enabled the estab- 
lishment of genetic maps for numerous important crop plants (e.g. barley: 
Tsuchiya 1986). The biochemical markers are very limited in number, 
however, a few are tightly coupled to important genes and thus very useful 
in selection (e.g. the endopeptidase isozyme allele Ep-Dlb for detection of 
the Pchl gene conferring resistance to Pseudocercosporella herpotrichoides 
in wheat, or the correlation of bread-making quality and the presence of 
particular high-molecular weight glutenin subunits among the seed storage 
protein complex of wheat). 

Although a DNA marker for Wx-Blb , the null allele important for starch quality, is available 
(McLauchlan et al. 2001), an ELISA assay that has now been developed for Wx-Bl (Gale et 
al. 2001 ) is likely to be much more cost-effective than the DNA marker currently being used. 

For molecular markers a severe restraint exists in application. The cost for 
this process in plants is as high as in human medicine. Since even the best 
new cultivar will have difficulties to make so much money that all molecular 
work can be paid by this gain, we are faced with the following dilemma. One 
of the biggest challenges in the future will be to understand how genetics 
and biology work together and it is not only scientific interest by which this 
development is driven, it is also the need for more food and feed. The annual 
increase in plant productivity has become less than in previous years (James 
2002). The three pillars: genetics, chemistry, and mechanization, on which 
the increased yield in agricultural production is based, are no longer con- 
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tributing equally; progress in the latter two is mainly improving environ- 
mental conditions rather than productivity. Thus, we urgently need further 
understanding of gene function in order to manipulate it for the benefit of 
mankind. On the other hand, the projected costs are unlikely to be covered 
by seed consumers nor by environmental revenues. The dispute whether 
MAS selection is a fund-adsorbing system (Melchinger and Utz 2002) or 
whether it is the most advanced screening remains unresolved. The answer 
will be neither a clear yes or no. It will depend very much on the type of trait 
in question (qualitative or quantitative) and the applicability of the respec- 
tive marker. 



4 Present Status of Validated Molecular Markers 
for Molecular Breeding of Important Crops 

A huge number of agronomically important genes have been tagged with 
the aid of molecular markers. However, most of them, although not the case 
for ‘perfect’ (gene sequence-derived) markers, do not meet conditions for 
marker-assisted selection (MAS), which are as follows: 

1 . Markers should co-segregate or map as closely as possible to the target 
gene (2 cM or less) to have low recombination frequency between the 
target gene and the marker. A better estimate of map distance between 
the target gene and the marker will be obtained by analysing further 
mapping populations which have genotypes in common with those used 
in the initial mapping population. Accuracy of MAS will be improved if, 
rather than a single marker, two markers flanking the target gene are 
used (Peng et al. 2000). 

2. For unlimited use in MAS, the marker should display polymorphism 
between genotypes that have and do not have the target gene. 

3. Cost-effective, simple PCR markers are required to ensure genotyping 
power needed for the fast screening of large populations. 

Markers that have been elaborated and validated for the monitoring of 
agronomically important traits and which are used in current breeding 
programs are listed in Table 1. Most of them are detectable using simple 
PCR. However, some need to be tested for polymorphism between parental 
lines using breeding programs. In cases where RFLP markers have been 
used, such as for monitoring the cereal cyst nematode resistance gene Crel 
in wheat (Ogbonnaya et al. 2001b), the gain fitted the effort due to the 
importance of the disease, high costs and unreliability of the bioassays. 

Resistance gene analogs (RGAs) become more and more interesting as 
molecular markers (Table 1). RGAs are primarily located close to resistance 
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Table 1. Validated markers for the monitoring of agronomically important traits in marker- 
assisted selection programs 



Crop 


Trait 


Gene(s) 


Marker type(s) 


References 


Wheat 


Reduced height 


Rht-Blb 


Gene-specific 


Ellis et al. (2003) 






Rht-Dlb 


SNP markers 
(competitive AS a -PCR) 








Rht8 


SSR 


Korzun et al. (1998) 




Starch quality 


Wx-Blb 


Gene-specific AS 


McLauchlan et al. 
(2001) 




Powdery mildew 


Pmlc 


AFLP 


Hard et al. (1999) 




resistance 


Pml7 


RFLP-derived STS 


Mohler et al. (2001) 






Pm24 


SSR 


Huang et al. (2000) 




Cereal cyst 


Crel 


RGA-RFLP 


Ogbonnaya et al. 




nematode 

resistance 


Cre3 


RGA-based STS 


(2001b) 




Leaf rust 


Lr21 


RFLP-derived STS 


Huang and Gill (200 1) 




resistance 


Lr47 


RFLP-derived CAPS b 


Helguera et al. (2000) 




VPM rust 


Sr38/Lr38/ 


RGA-based STS 


Seah et al. (2001) 




resistance 


Yrl7 






Barley 


Barley yellow 


rym4 


SSR 


Graner et al. (1999) 




mosaic virus 


rym5 








resistance 


rym9 


RAPD-derived STS 


Werner et al. (2000) 


Rice 


Bacterial blight 


xa-5 


Trait-flanking 


Huang et al. (1997) 








RFLP-derived 
CAPS and STS 


Sanchez et al. (2000) 






xa-13 


RFLP-derived CAPS 


Huang et al. (1997) 






Xa-21 


RAPD-derived STS 


Chunwongse et al. 
(1993) 




Blast resistance 


Pil 


Trait- flanking RFLPs 


Hittalmani et al. 






Piz-5 


RFLP-derived CAPS 


(2000) 






Pita 


Trait- flanking RFLPs 




Potato 


Potato virus Y 


Ryadg 


RGA-based CAPS 


Sorri et al. (1999) 




resistance 


RGA-based STS 


Kasai et al. (2000) 


Soybean 


Soybean cyst 
nematode 


Rhg4 


TaqMan assay 


Meksem et al. (2001) 



a AS, allele-specific. 

p CAPS, cleaved amplified polymorphic sequence. 
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genes and are often members of local multigene families (Liibberstedt et al. 
2002). Due to their clustered organization, the identification of the active 
gene copy remains a major challenge, but a high probability is given to find 
an informative SNP associated with the resistance phenotype. Homologous 
sequences from the common wheat Cre3 locus isolated from Aegilops 
ventricosa , and shown to be of chromosome 5Nv origin, were successfully 
used as perfect markers for the presence of the Cre6 gene (Ogbonnaya et al. 
2001a). Costs of marker development may also be reduced since there is a 
chance of RGA markers to be used across the genomes of species from the 
same plant family (Mohler et al. 2002). Such broader markers are urgently 
needed. 

In this context, the availability of SSR markers from the expressed 
portion of the genome might facilitate their transferability across genera, 
compared to the low efficiency of SSR markers retrieved from gene-poor 
areas in cross-generic amplification (Peakall et al. 1998). This approach 
would also benefit plant species with minimal resource and research expen- 
diture. 



5 Marker- Assisted Selection (MAS) 

MAS offers the opportunity to select desirable lines based on genotype 
rather than phenotype, especially in the case of combining different resis- 
tance genes for a given pathosystem in a single genotype (gene pyramid- 
ing), since it is difficult to select plants with multiple resistance genes based 
on phenotype alone as the action of one gene may mask the action of 
another. Successful examples are the combination of genes 
xa4+xa5+xal3+Xa21 against bacterial blight disease in rice (Huang et al. 
1997; Sanchez et al. 2000; Singh et al. 2001), Pil+Piz-5+Pita conferring blast 
resistance in rice (Hittalmani et al. 2000) and Pm2+Pm4a , Pm2+Pm21 , 
Pm4a+Pm21 against powdery mildew disease in common wheat (Liu et al. 

2000) . Novel approaches deal with the implementation of transgenes in 
breeding programs such as the pyramiding of Bt genes cry 1 Ac and cry 1C 
conferring resistance to diamondback moths in broccoli (Cao et al. 2002). 

Molecular markers also contribute to improvement in efficiency of 
backcrossing, allowing at each backcross cycle the identification of carriers 
of the target trait (foreground selection) with the closest fit to the recurrent 
parent genotype (background selection). Two studies (Chen et al. 2000, 

2001) reported on the improvement of restorer lines widely used in Chinese 
hybrid rice production, to bacterial blight resistance, caused by Xantho- 
monas oryzae pv. oryzae (Xoo), through introgression of Xa21 , a broad- 
spectrum bacterial blight resistance gene. The PCR-based foreground se- 
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lection system consisted of a marker that was part of gene Xa21, while RFLP 
and AFLP markers, respectively, were applied for background selection. 



6 Strategies to Handle Complex Characters 

It is still beyond practical use to apply MAS routinely for QTLs. Only a few 
reports are available on the successful use of QTL-MAS in backcross-as- 
sisted breeding strategies (Shen et al. 2001; Willcox et al. 2002). Advanced 
backcross QTL (AB-QTL, Tanksley and Nelson 1996) analysis was proposed 
as a general strategy for the simultaneous detection of QTL qualified for 
breeding purposes and variety development. The use of QTL analysis will 
be delayed until an advanced backcross generation offers advantages for 
QTL characterization such that the probability is reduced for the detection 
of QTL displaying epistatic interactions among donor alleles due to the 
overall lower frequency of donor alleles. In fact, there will be a higher 
probability of detecting additive QTL which will still function in a nearly 
isogenic background. 

Efforts in indexing and sequencing of EST collections allow high- 
throughput scanning of SNPs among EST alleles from different genotypes. 
Along with SNP technology, which principally makes comprehensive 
haplotype analyses possible, association mapping (Risch 2000) has the 
potential to play a central role in the task of uncovering which candidate 
gene alleles cause complex traits in crop plants. 



7 Outlook from a Breeder’s Perspective 

The number of molecular marker techniques and the applicability of domi- 
nant and co-dominant DNA markers made molecular plant breeding in 
principle a reality. The speed by which a new marker technique was used 
in laboratories was, however, such that a new and more efficient system 
became available before the latest marker system could be fully optimized 
or applied (Gupta et al. 2002b). Thus, breeders became reluctant to invest 
in a new technique which was already outdated when ready for use. Another 
drawback has been the high costs for a single test, resulting in limited 
genotype numbers that could be screened at a reasonable price. Conse- 
quently, the most advanced SNP allele calling procedures are not gel based 
and, therefore, readily automated for high-throughput genotyping, allow- 
ing the cost-effective screening of large populations. They are probably the 
most attractive tool for marker-assisted selection in future plant breeding 
programs. 
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Presently, only under specific applications, has MAS become a practical 
tool. MAS is applied in the selection for traits controlled by major genes 
responsible for important characters which are difficult or expensive to 
phenotype, e.g. resistance to soil-borne viruses (Friedt et al. 2001). Also, 
pyramiding of monogenically controlled resistances by MAS that result in 
an oligogenic trait causing more comprehensive resistance has been 
adopted by the breeding industry (Wenzel et al. 2001). Before genes can be 
combined in such a programmed way, it is necessary to determine whether 
or not they are allelic. Of course, pyramiding works for additive genes only 
and a serious problem is the specificity of the majority of available markers. 
They work only for those parents for which they have been precisely 
characterized. Breeders still need more universal markers. 

The situation for traits controlled polygenically is even less satisfactory. 
Although numerous papers have been published describing and localizing 
QTL, understanding their interaction is very difficult. The impact of factors 
like temperature, day length, humidity on gene expression and metabolism 
is multifaceted and will require much more understanding of general gene 
functions. Moreover, a clear strategy as to how to make use of QTL via gene 
transformation is not yet feasible. The ideas go via master or major genes, 
via regulatory elements to a complete slicing and subsequent stepwise 
combination (Khush 2002). Because of multigenic responses, the question 
of additivity of gene products needs to be explored, normally not an easy 
task. 

The over-expression of phytochrome B in potato resulted, e.g. under greenhouse conditions, 
in an increased yield (Thiele et al. 1999). Running the experiment under field conditions 
could not verify the greenhouse results (Braun et al. 2002; Schittenhelm 2002). To optimize 
allele complexes with high stability and a minimum of side effects requires an understanding 
of factors that are limiting in their expression as close to the end product in signalling and 
transcription pathway as possible (Winicov 2002). Successful identification of genes being 
able to accomplish this would provide reliable markers for plant breeding. A more global 
understanding of the action of QTL might allow one to treat them in slices as Mendelian 
characters in a monogenic manner. 

Marker-based approaches are reaching a limit, opening up the newer field 
of genomics, dealing with the genes/enzymes directly rather than with 
indirect markers. Without using map-based cloning, chances are improv- 
ing to find precise gene functions, e.g. via differential display techniques 
(Thummler and Wenzel 2000). Genomics refers to application of global (in 
contrast to gene by gene) experimental approaches using DNA sequence 
data to assess gene functions by parallel methods such as expression pro- 
filing and screening for insertion mutants (Bouchez and Hofte 1998). For- 
tunately, information gathered in one species might be transferred to 
another species by conserved map location or via orthologous genes in 
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related species (e.g. in the Triticeae: Mohler et al. 2002) and detraction of 
sequence databases (in silico genetics). 

Most mapping and marker isolation has been done at the diploid level. In tetraploids like 
potato the dosage effect by doubling the ploidy level might have a striking influence (Luo et 
al. 2001). Markers are also useful for organelle DNA where mitochondrial DNA is more 
variable than plastid DNA. It should become possible to select with the help of markers for 
mitochondria which fit our demands better than mitochondria developed through natural 
evolution (Frei et al. 2003). The necessary variability has to be produced via somatic fusion. 
Thus, this approach works quite efficiently in vegetatively propagated tetraploid crops like 
potato. An optimal combination of plastome, chondriome and genome, using somatic 
hybrids (Lossl et al. 2000), could increase starch production. 

Fortunately, molecular markers are an accepted tool of molecular genetics 
even by the public, which is generally critical of gene technology. The 
combined application of molecular markers and genome-wide, global ap- 
proaches will contribute to better genotype descriptions and securer selec- 
tions. As soon as validated universal markers reach the breeding stations, 
new concepts for population sizes and selection intensities will support 
condensed breeding programs. Perhaps the size reduction in a conven- 
tional program can counterbalance the higher cost for laboratory assis- 
tance. There is also a strong tendency to believe that classical combination 
breeding together with MAS is more universal in achieving breeding goals 
than gene transfer strategies. 
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Extranuclear Inheritance: 
Chloroplast Proteomics 

Michael Hippier and Ralph Bock 



1 Introduction: 

The Choroplast as a Model System in Proteomics 

Biology has arrived in the “omics” age. Currently, there is no better justifi- 
cation for the importance of one’s own research field than adding the suffix 
“omics” to it (Fig. 1). This somewhat unfortunate tendency was initiated 
with the systematic and high-throughput sequencing of entire genomes for 
which the term “genomics” was coined. Soon, researchers using systematic 
approaches to elucidate gene functions felt it important to distinguish 
between “structural genomics” (i.e. genome sequencing) and “functional 
genomics” (i.e. elucidation of gene functions; see e.g. Bock and Hippier 
2002). What are the criteria for “omics”? Certainly, any “omics” should (1) 
take a systematic approach and (2) use high-throughput techniques with 
the ultimate goal of achieving completeness (complete sequence, complete 
set of RNAs, proteins, metabolites, etc.). From this viewpoint, clearly, some 
fields are relatively far (e.g. “metabolomics”) or even very far (e.g. “struc- 
turomics”) from meeting these criteria and, here, enthusiastic addition of 
the ending “omics” appears premature (Fig. 1). 

In this review, we aim to cover a field for which the “omics” is very well 
justified: proteomics. Proteomics can be defined as the systematic analysis 
of proteins produced in a single organism, tissue, cell or subcellular com- 
partment at a certain time and covers three main aspects: protein identity, 
quantity and function(s). The central concept of proteomics is the simulta- 
neous study of all proteins in a given protein population, rather than 
analysis of one protein at a time, as in traditional biochemistry. 

Recent technological progress has made it feasible to separate and ana- 
lyze the full protein set of subcellular compartments within a reasonable 
time frame. Cell organelles provide excellent model systems for proteomics, 
because (1) they contain a relatively small, manageable set of proteins, (2) 
they can be easily isolated in large amounts and at high purity and (3) they 
can be further fractionated into suborganellar compartments (“subpro- 
teomes”), such as outer and inner membranes, the intermembrane space, 
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Fig. 1. The world of “omics” in modern biology 
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chloroplast thylakoids, etc. In this chapter, we shall summarize bioinfor- 
matic attempts to estimate the size of the chloroplast proteome, highlight 
the state of the art in proteome research and describe the results of recent 
studies on chloroplast subproteomes. 



2 The Complexity of the Chloroplast Proteome 
and Its Evolutionary Origin 

Plastids are derived from formerly free-living prokaryotes: More than a 
billion years ago, a cyanobacterium was engulfed by a eukaryotic host cell 
(already possessing mitochondria) and, different from endocytosis, was not 
digested but entered a symbiotic relationship. Subsequently, the cyanobac- 
terial endosymbiont was gradually integrated into the metabolism of the 
host cell. Together, host and endosymbiont established a sophisticated 
division of labor by inventing efficient regulatory networks to coordinate 
the gene expression in the endosymbiont’s genome with that in the host’s 
nuclear genome. The evolutionary shaping of the endosymbiosis involved 
the (1) loss of dispensable genetic information (e.g., genes for cyanobacte- 
rial cell wall biosynthesis), (2) elimination of redundant genetic informa- 
tion (e.g., biosynthetic pathways for carbohydrate, amino acid and lipid 
biosyntheses), and (3) the massive translocation of genetic information 
from the endosymbiont’ s DNA to the host cell’s nuclear genome. Conse- 
quently, present-day organellar genomes are greatly reduced and contain 
only a small proportion of the genes that their free-living ancestors had 
possessed: Higher plant plastid genomes harbor only about 130 genes in 
approximately 150 kbp whereas the cyanobacterium Synechocystis has 
more than 3100 genes (and open reading frames) in a genome of 3.57 Mbp. 
It has long been known that the vast majority of chloroplast proteins is not 
encoded by the plastid genome but is encoded by nuclear-localized genes, 
synthesized on cytosolic ribosomes and post-translationally imported into 
plastids. Protein targeting to plastids is directed by an N-terminal transit 
peptide which usually is cleaved off during the import process. 

The availability of complete genome sequences for the cyanobacterium 
Synechocystis (Kaneko et al. 1996; Kaneko and Tabata 1997; Kotani and 
Tabata 1998) and the model plant Arabidopsis thaliana (The Arabidopsis 
Genome Initiative 2000) has facilitated the systematic reconstruction of the 
evolutionary origin of plant nuclear genes as well as the calculation of 
proteome sizes for different compartments of the plant cell. Analyzing a 
sample of 3961 Arabidopsis nuclear protein-coding genes (out of a total of 
-25,000 genes in the Arabidopsis genome) using a number of bioinformat- 
ics tools (homology searches, protein targeting prediction, construction of 
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phylogenetic trees; Table 1), Rujan and Martin calculated that between 400 
(1.6%) and 2200 (9.2%) of the Arabidopsis nuclear genes stem from the 
cyanobacterial endosymbiont (Rujan and Martin 2001) and have ended up 
in the nuclear genome through gene transfer events out of the plastid. These 
numbers are in good agreement with another study which estimated that 
between 1150 and 1550 genes of cyanobacterial origin reside in the Arabi- 
dopsis nuclear genome (Abdallah et al. 2000). [A very recent new estimate 
incorporating data from 19 fully sequenced prokaryotic genomes suggests 
an even higher number of Arabidopsis genes of cyanobacterial origin (Mar- 
tin et al. 2002).] The gene products of how many of these cyanobacterially 
derived nuclear genes are reimported into plastids and contribute to the 
size of the plastid proteome? Searches for putative N-terminal transit pep- 
tide sequences directing chloroplast protein import (see Table 1) suggest 
that between 650 and 900 Arabidopsis nuclear genes of cyanobacterial 
origin encode plastid-targeted proteins (Abdallah et al. 2000). The rest of 
the chloroplast proteome (estimated to be in the range of 1200 to 1600 
proteins) comprises three categories: (1) genes coming from the genome of 
the eukaryotic host cell which, during evolution, have acquired sequences 
for chloroplast transit peptides, (2) new plant-specific genes which evolved 
after endosymbiosis and (3) genes of currently unknown evolutionary 
origin (some of which may be additional cyanobacterially derived genes). 
Thus, in total, the plastid proteome consists of 1900 to 2500 proteins of 
which: 

- approximately 80 are encoded by the plastid genome, 

- 650 to 900 are encoded by nuclear genes of cyanobacterial origin, 

- 1200 to 1600 are encoded by nuclear genes of non-cyanobacterial (or 

uncertain) evolutionary origin. 

All these calculations exclude additional protein diversity generated, for 
example, by processes like alternative splicing, differential protein process- 
ing or post-translational modifications. As it is currently unknown how 
widespread such protein isoforms made from one and the same gene are in 
chloroplasts, the total size of the plastid proteome may currently be under- 
estimated. 

Improved proteomics technologies (van Wijk 2000) nowadays make it 
possible to manage a set of 1900 to 2500 proteins as calculated to be present 
in higher plant chloroplasts, all the more since this set of proteins is further 
subcompartmentalized (into intermembrane space, stroma, thylakoid lu- 
men and the respective membrane fractions) facilitating its convenient 
experimental splitting into subproteomes. 




Table 1. Selected WWW tools for chloroplast genomics, proteomics and bioinformatics 
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ExPASy Various proteomic tools for the analysis of protein sequences http://www.expasy.ch/ 

and structures as well as 2-D PAGE 

MOTIF Searches for protein (and nucleic acid) sequence motifs http://www.motif.genome.ad.jp/ 
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3 Methods in Chloroplast Proteomics 

The analysis of complex protein profiles requires methods that allow for 
high resolution protein separation combined with very sensitive methods 
for protein identification (Fig. 2). The standard technique for quantitative 
proteome analysis is protein separation by high-resolution (isoelectric 
focusing/SDS-PAGE) two-dimensional gel electrophoresis (2-DE). This 
method separates proteins according to their isoelectric points in the first 
dimension and according to their molecular masses in the second. Al- 




Fig. 2. High-resolution protein separation and mass spectrometric identification strategies. 
This simplified schematic diagram outlines strategies that depend on 2-DE ( solid ellipses) 
or alternatives that do not require protein separation by 2-DE ( dotted ellipses). In 2-DE- 
based approaches, a mixture of proteins is separated by 2-DE and visualized by staining. 
Individual protein spots are then excised and subjected to proteolysis. The resulting peptide 
mixture can subsequently be analyzed either by time-of- flight mass spectrometry (MALDI- 
TOF) to determine the masses of individual peptides (by mass mapping) or tandem mass 
spectrometry to determine (partial) amino acid sequences of the respective peptides. Protein 
separation strategies alternative to 2-DE utilize peptide mixtures resulting from proteolysis 
of a mixture of intact proteins. Peptide mixture can be separated by single (LC-MS/MS), 
multidimensional (LC/LC-MS/MS) HPLC or capillary electrophoresis (CE) and then ana- 
lyzed on-line by tandem mass spectrometry. It should be noted that peptide mixtures 
obtained from proteolytic digestion of 2-DE separated spots can also be subjected to liquid 
chromatography prior to mass spectrometric analysis 
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though 2-DE was introduced already almost 30 years ago (O’Farrell 1975), 
new developments, like the introduction of immobilized pH gradients 
(IPG) or in-gel sample application, have greatly increased the reproduci- 
bility of 2-DE. This is important not only for comparative analyses of 
analytical 2-D protein maps, but also for their methodological stand- 
ardization. The resolving power of this technique can be rationalized by the 
fact that over 10,000 proteins can be separated on a single gel (Klose 1999). 
To obtain sequence information from individual protein spots, the spots 
are cut out of the gel and digested in-gel with an appropriate protease, 
mostly trypsin. The resulting peptides are eluted and analyzed by mass 
spectrometry (MS). Based on sensitivity, mass spectrometry is the method 
of choice for identification of proteins. To perform mass spectrometry, 
several possible instrumentations are available. Matrix-assisted laser 
desorption ionization (MALDI) time-of-flight (TOF) instruments are com- 
monly used for large-scale protein identification by the peptide mass map- 
ping technique and are also used in chloroplast proteomics (Peltier et al. 
2000, 2001, 2002; Yamaguchi and Subramanian 2000; Yamaguchi et al. 2000; 
Schubert et al. 2002). Peptide masses are determined for specific spots by 
MALDI-MS and these mass maps are then compared with predicted mass 
maps in the database to identify the respective protein. This peptide mass 
mapping strategy is a powerful tool for protein identification. However, one 
of its limitations is that the database must contain sufficient protein se- 
quences from the organism under investigation, which is often a problem 
with plants since so far, only the Arabidopsis genome is completely se- 
quenced. Another limitation of the MALDI-TOF mass mapping technique 
is that it is very sensitive to contamination with other proteins or protein 
isoforms arising, for example, from splice variants, RNA editing or post- 
translational modifications, which increase the ambiguity in protein iden- 
tification. Besides the MALDI ionization technique, electrospray ionization 
(ESI) is frequently used and coupled with triple quadropole, ion-trap and 
hybrid quadropole-time-of- flight (Q-TOF) tandem MS. ESI-MS instrumen- 
tations are regularly utilized in combination with liquid chromatography 
(LC) or static nanospray techniques. Tandem MS can be employed to 
generate peptide fragment ion spectra by collision induced fragmentation 
(CID). CID predominantly causes fragmentation of peptide bonds. There- 
fore, it is possible to search for mass differences between peaks in the mass 
spectrum that differ by the mass of a single amino acid (which was deleted 
at a specific position within the peptide), thereby facilitating the deduction 
of short amino acid sequences. Software tools are available for the interpre- 
tation of amino acid sequence information from CID fragmentation spec- 
tra. These tools compare the experimental data with predicted spectra 
generated from database sequences. Comparisons between an experimen- 
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tal spectrum and spectra from the database can be done with Mascot 
(Perkins et al. 1999), an Internet-accessible program, or Sequest (Eng et al. 
1994), a program that is commercially available. The extraction of short 
sequence “tags” from MS data requires partial amino acid sequence inter- 
pretation and can be carried out by PeptideSearch (Mann and Wilm 1994), 
another Internet-accessible program. In chloroplast proteomics, both ESI- 
MS and tandem MS have been used in a number of studies (Whitelegge et 
al. 1998; Peltier et al. 2000, 2001, 2002; Yamaguchi and Subramanian 2000; 
Yamaguchi et al. 2000; Hippier et al. 2001; Ferro et al. 2002; Gomez et al. 
2002; Roller et al. 2002). 

Recently, new methods in proteomic research have been developed 
which promise to be of enormous usefulness also for chloroplast pro- 
teomics. Here, we will briefly describe some of these new techniques 
(Fig. 2). Several of the new approaches take advantage of the coupling 
between liquid chromatography and tandem MS (LC/MS/MS) (Peng and 
Gygi 2001). Multi-dimensional liquid chromatography coupled to MS/MS 
allowed, for example, the identification of more than 70 proteins present in 
the yeast ribosome in a single analysis. This study was performed by 
analyzing tryptic peptides derived from digestion of the whole complex 
(Link et al. 1999). Multidimensional liquid chromatography coupled to 
tandem MS can also be used for large-scale proteome analysis. Using this 
combination of techniques, 1484 proteins could be identified from Sac - 
charomyces cerevisiae including lowly abundant proteins like transcription 
factors and kinases which probably would have escaped detection by 2-DE 
(Washburn et al. 2001). The power of this experimental approach was also 
demonstrated by the systematic proteomic analysis of rice ( Oryza sativa) 
leaf, root and seed tissues (Roller et al. 2002). This comparative study 
resulted in detection and identification of 2528 unique proteins. Among 
these proteins, 622 were leaf-, 862 root- and 512 seed-specific proteins, 
demonstrating that the overall protein expression profile is highly tissue- 
specific. 

For quantifying differential protein expression, the isotope-coded affin- 
ity (ICAT) strategy is another approach that can be applied to LC/MS/MS 
(Gygi et al. 1999). Also, a robust gene tagging method is now available that 
facilitates the systematic analysis of components of multiprotein complexes 
(Gavin et al. 2002). Here, a gene encoding a subunit of the complex is tagged 
with a cassette coding for protein A and a calmodulin-binding protein 
(separated by a TEV protease cleavage site). Upon expression of the tagged 
gene in vivo, the respective gene product incorporates into its native 
complex allowing subsequent tandem-affinity purification (TAP) of the 
multiprotein complex by a two-step affinity chromatography procedure: 
The first step involves binding of the tagged complex via protein A to an 
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IgG column, followed by cleavage with the TEV protease releasing the 
purified complex. The second purification step employs calcium-cal- 
modulin affinity chromatography to eliminate residual contaminants. Fi- 
nally, individual components of the complex are analyzed by MS/MS (Puig 
et al. 2001). 



4 Proteomics of Chloroplast Subcompartments 

4.1 Proteomics of Soluble Proteins 

As mentioned above, possibly more than 2000 nuclear-encoded gene prod- 
ucts, in addition to the about 80 chloroplast-encoded proteins, are present 
in the chloroplast. To establish number, identity and abundance of these 
proteins, proteomics has become the most important tool. Especially the 
large-scale analysis of soluble proteins is straightforward since 2-DE based 
approaches are directly applicable to high-resolution separation of these 
proteins combined with mass spectrometric identification techniques. An 
impressive proteomics study of a soluble multiprotein complex succeeded 
in the identification of all ribosomal proteins (RPs) in spinach chloroplasts 
(Yamaguchi and Subramanian 2000; Yamaguchi et al. 2000). The spinach 
plastid ribosome is of the prokaryotic 70S type and comprises 59 proteins: 
33 in the 50S and 25 in the 30S subunit. Plastid ribosomes are typical 
organellar multiprotein complexes in that they consist of both nuclear and 
organelle genome-encoded proteins (25 of the plastid 50S and 13 of the 30S 
RPs are nuclear encoded, whereas 8 of the 50S and 12 of the 30S RPs are 
plastid encoded). The plastid RPs also serve as good examples for the 
expression of a single gene resulting in a protein population that displays 
micro-heterogeneity. Such heterogeneity is observed with several plastid 
RPs and is caused by post-translational modifications, such as, oc-AT-acety- 
lation of S9, differential N- terminal processing leading to five mature forms 
of S6 and two mature forms of S10 (30S subunit) and N- terminal/internal 
modifications in L2, LI 1 and L 16 (50S subunit). The functional implications 
of these modifications are still unclear but it is tempting to speculate that 
they may serve regulatory functions in the plastid translational apparatus. 

Another chloroplast multi-protein complex that was dissected by pro- 
teomics is the ClpP protease complex from Arabidopsis (Peltier et al. 2001). 
The 350-kDa protease complex was isolated by blue-native gel electropho- 
resis and SDS-PAGE and ten different Clp isoforms were identified by mass 
spectrometry. 

In a first systematic analysis of the proteome of a chloroplast subcom- 
partment, 2-DE based approaches were taken to analyze soluble lumenal 
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and peripheral thylakoid proteins from pea (Peltier et al. 2000). From this 
study, it was estimated that at least 200 to 230 different lumenal and 
peripheral proteins exist. The proteome of the thylakoid lumen was inves- 
tigated in greater detail and a 2-D map of Arabidopsis lumenal proteins has 
been established (Peltier et al. 2002; Schubert et al. 2002). In both studies, 
30 to 36 lumenal proteins were identified and it was estimated that the 
Arabidopsis thylakoid lumen may contain a total of about 80 proteins. The 
presence of multiple isomerases, peroxiredoxins, m-type thioredoxins and 
a putative ascorbate peroxidase indicates that the lumenal proteome con- 
tains protein folding activities and is involved in the antioxidative defense 
network. It is noteworthy that among these proteins, a family of novel PsbP 
domain proteins was found. The extrinsic proteins PsbO, PsbP and PsbQ 
participate in the regulation of oxygen evolution and are present in the 
photosystem II (PSII) water- splitting complex in a stoichiometric ratio of 
1:1:1. These novel PsbP domain proteins are possibly interchangeable at the 
lumenal side of PSII and could thereby provide a functional tool to modu- 
late oxygen evolution activity of the PSII protein complex in response to 
environmental cues and/or developmental programs (Peltier et al. 2002; 
Schubert et al. 2002). These studies nicely demonstrate that proteomics 
cannot only identify protein sets, but also provides new functional impli- 
cations and hypotheses. To convert these hypotheses into real under- 
standing of biological function, new experiments can now be designed. 



4.2 Proteomics of Membrane Proteins 

The separation of hydrophobic intrinsic membrane proteins by 2-DE has 
long been a difficult task (Santoni et al. 2000). However, with the recent 
development of procedures for the analysis of transmembrane thylakoid 
proteins by 2-DE, proteomics of membrane proteins has become feasible. 
The successful separation of transmembrane proteins by 2-DE was 
achieved by combining extraction of hydrophobic proteins with organic 
solvents and subsequent solubilization of the precipitated proteins in a 
mixture of detergents together with urea and thiourea. 2-DE separation was 
combined with identification of protein spots by MS/MS and immunobio- 
chemical techniques (Hippier et al. 2001). This procedure allowed the 
separation and identification of hydrophobic transmembrane proteins, 
such as the light-harvesting proteins, and was used to create 2-DE protein 
maps of thylakoid membrane proteins from wild type and mutant strains 
of Chlamydomonas reinhardtii (Hippier et al. 2001). The technique was also 
used to monitor changes in the protein composition of the thylakoid 
membrane during adaptation to iron deficiency (Moseley et al. 2002; see 
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Fig. 3. Example of a comparative proteomics study in Chlamydomonas chloroplasts. The top 
panel (day 0 and day 5) represents silver-stained 2-DE gels of thylakoid membranes from 
wild-type cells before and 5 days after growth in iron-deficient (0 pM Fe) medium, respec- 
tively. Circles indicate spots that disappear and squares indicate spots that newly appear (see 
also arrows) or increase in intensity under iron deficiency. The lower panel shows the 
evolution of a new putative LHCI protein during the adaptation to iron deficiency. Note that 
spots 3, 6 and 9 in the upper panel represent LHCI polypeptides. The lower panel shows the 
enlarged box B (upper panel) from silver-stained 2-DE gels of thylakoid membranes from 
wild-type cells before and after 1 to 5 days of growth in iron-deficient medium. (Adapted 
from Moseley et al. 2002) 



Fig. 3). This adaptation process is rather complex and highly dynamic since 
several protein spots disappear and a number of new protein spots appear 
as compared to 2-DE maps from iron repletion conditions (Fig. 3). Inter- 
estingly, some of the photosystem I light-harvesting complex (LHCI) 
subunits are degraded during adaptation to iron deficiency whereas new 
LHCI protein spots are induced under these conditions (Fig. 3). It is cur- 
rently unclear whether these iron deficiency-induced LHCI spots represent 
newly synthesized isoforms or arise from post-translational modification 
or processing of already existing proteins. 

In another study, a detailed 2-D protein map of Lhcb proteins from C. 
reinhardtii was established (A. Fink, E. Stauber, U. Johanningmeier, and M. 
Hippier, in prep.). Lhcbl (CabII-1), LhcII-1.3, CabII-2 and LhcII-4 as well 
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as CP29 and CP26 protein spots were identified. Interestingly, two Lhcbl 
proteins with different molecular masses and different isoelectric points 
were found. Previous mass spectrometric analysis of 2-DE separated Lhcbl 
had already indicated that this LHCII protein may contain two alternative 
N-terminal transit peptide cleavage sites (Hippier et al. 2001). 

An HA-tag was introduced into the lhcbl gene in such a way that it is localized behind the 
second suggested N-terminal processing site in the expressed gene product (Imbault et al. 
1988). Anti-HA monoclonal antibodies recognized two protein bands with different molecu- 
lar masses (mass difference: 1-2 kDa) after SDS-PAGE fractionation of thylakoids from the 
Lhcbl -HA-tagged algal strain. Immunoblot experiments with 2-DE separated Lhcbl -HA- 
tagged thylakoids using anti-Lhcbl peptide antibodies (directed against a sequence up- 
stream of the second processing site) and anti-HA antibodies revealed that the lower 
molecular weight spots recognized by the HA antibody are not recognized by the anti-Lhcb 1 
antibodies, demonstrating that processing occurs at the N-terminus. Thus, the Lhcb 2-D 
mapping results and the tagging experiment independently confirmed that two differen- 
tially N-terminally processed Lhcbl forms exist in vivo (A. Fink, E. Stauber, U. Johanning- 
meier, and M. Hippier, in prep.). 

A non-2-DE based proteomic analysis of integral membrane proteins of the 
chloroplast envelope has been established by combining extraction of 
hydrophobic proteins with organic solvents, SDS-PAGE separation and 
MS/MS (Ferro et al. 2002). This approach has led to the identification of 
altogether 54 proteins. 27 of them were newly identified envelope proteins, 
most of them having several F128a-helical transmembrane domains. Some 
of these integral proteins are most likely transporter proteins, which have 
not been predicted to be localized in the chloroplast envelope before. The 
identification of these new proteins together with proteins that were already 
known for their localization in the chloroplast inner envelope allowed to 
define features shared by these proteins. These common features were then 
used to generate a virtual database of integral membrane proteins in the 
Arabidopsis plastid envelope. This database will be an important tool for 
the further characterization of chloroplast envelope membrane proteins by 
biochemical and/or reverse genetic approaches. 

In addition to protein separation by 2-DE or SDS-PAGE, intact mem- 
brane proteins can also be extracted with organic solvents and separated by 
reverse-phase HPLC. Identification of chloroplast transmembrane proteins 
has been achieved by coupling liquid chromatography with electrospray- 
ionisation MS and MS/MS. This combination of techniques also succeeded 
in the characterization and identification of intact intrinsic thylakoid mem- 
brane proteins (Whitelegge et al. 1998; Corradini et al. 2000; Huber et al. 
2001; Gomez et al. 2002). 

All these proteomics analyses of chloroplast intrinsic membrane pro- 
teins demonstrate that, contrary to earlier belief, the hydrophobic protein 
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core of a membrane system can be dissected by systematic proteomics 
approaches. The technical progress that has come with these studies is likely 
to have general implications for research on biological membranes. 



5 Summary and Outlook 

With recent fascinating progress in protein separation techniques and 
mass spectrometry methods, the global analysis of prokaryotic proteomes 
and eukaryotic subproteomes has now become feasible. Chloroplast sub- 
proteomes have provided excellent model systems to improve proteomics 
methods and refine bioinformatics tools for large-scale protein data analy- 
sis. Clearly, it will be important to combine the structural analysis of 
proteomes with appropriate functional studies. At the same time, the 
increasing identification of novel protein isoforms and post-translational 
modifications by proteomics approaches raises new questions about the 
functional significance of this protein heterogeneity. 

What comes next? It seems clear that a complete picture of the chloro- 
plast proteome (and any other cellular or compartmental proteome) cannot 
be achieved in the near future, especially since the limited sensitivity of 
protein detection after electrophoretic separation at present prevents the 
identification of lowly abundant proteins. It is for this reason that, for 
example, most regulator proteins of key processes in gene expression and 
metabolic control currently escape detection in proteomics studies. Novel 
and more sensitive methods in protein separation and mass spectrometry 
will have to be applied to plastid proteomics (see above). However, with the 
recent acceleration of proteomics research, it can be envisioned that even 
low abundant regulatory proteins can be identified in the near future 
facilitating the dissection of regulatory protein networks with proteomics 
tools. 

Very obviously, the proteome of a given organism or compartment is 
more than the sum of the parts and by no means a static state of affairs. 
Although some information has been acquired on the protein set of the 
chloroplast, virtually nothing is known about the proteome of chro- 
moplasts, proplastids or amyloplasts. Also, how does the chloroplast pro- 
teome change during plant development or in response to changing envi- 
ronmental conditions (e.g. upon exposure of the plant to biotic or abiotic 
stresses)? These questions lead into a field which maybe called comparative 
proteomics. Here, plastids again provide an excellent model system since a 
large number of differentiation forms, developmental stages and environ- 
mental conditions can be compared for a reasonably small proteome. 
Moreover, all the many chloroplast mutants in photosynthesis and plastid 
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gene expression that have been isolated or experimentally generated (Bock 
and Hippier 2002) over the past decades are likely to become attractive 
targets of future comparative proteomics studies. 

In summary, we are still a long way from understanding the full com- 
plexity and dynamics of even a relatively small subproteome as the one of 
the chloroplast. Last, but not least, it needs to be borne in mind that 
proteomics, just like any other type of large-scale data collection, ultimately 
must lead to the generation of new knowledge: Huge amounts of collected 
data alone do not mean much. They must be converted into understanding 
- a challenge which goes far beyond collecting information. 



References 

Abdallah F, Salamini F, Leister D (2000) A prediction of the size and evolutionary origin of 
the proteome of chloroplasts of Arabidopsis. Trends Plant Sci 5:141-142 

Bock R, Hippier M (2002) Extranuclear inheritance: functional genomics in chloroplasts. 
Progress in botany 63. Springer, Berlin Heidelberg New York, pp 106-131 

Corradini D, Huber CG, Timperio AM, Zolla L (2000) Resolution and identification of the 
protein components of the photosystem II antenna system of higher plants by reversed- 
phase liquid chromatography with electrospray-mass spectrometric detection. J Chro- 
matogr A 886:111-121 

Eng J, McCormack AL, Yates JR (1994) An approach to correlate tandem mass spectral data 
of peptides with amino acid sequences in a protein database. J Am Soc Mass Spectrom 
5:976-989 

Ferro M, Salvi D, Riviere-Rolland H, Vermat T, Seigneurin-Berny D, Grunwald D, Garin J, 
Joyard J, Rolland N (2002) Integral membrane proteins of the chloroplast envelope: 
identification and subcellular localization of new transporters. Proc Natl Acad Sci USA 
99:11487-11492 

Gavin AC, Bosche M, Krause R, Grandi P et al. (2002) Functional organization of the yeast 
proteome by systematic analysis of protein complexes. Nature 415:141-147 

Gomez SM, Nishio JN, Faull KF, Whitelegge JP (2002) The chloroplast grana proteome 
defined by intact mass measurements from liquid chromatography mass spectrometry. 
Mol Cell Proteomics 1:46-59 

Gygi SP, Rist B, Gerber SA, Turecek F, Gelb MH, Aebersold R (1999) Quantitative analysis 
of complex protein mixtures using isotope-coded affinity tags. Nat Biotechnol 
17:994-999 

Hippier M, Klein J, Fink A, Allinger T, Hoerth P (2001) Towards functional proteomics of 
membrane protein complexes: analysis of thylakoid membranes from Chlamydomonas 
reinhardtii. Plant J. 28:595-606 

Huber CG, Timperio AM, Zolla L (2001) Isoforms of photosystem II antenna proteins in 
different plant species revealed by liquid chromatography-electrospray ionization mass 
spectrometry. J Biol Chem 276:45755-45761 

Imbault P, Wittemer C, Johanningmeier U, Jacobs JD, Howell SH (1988) Structure of the 
Chlamydomonas reinhardtii cabll-l gene encoding a chlorophyll-a/b-binding protein. 
Gene 73:397-407 

Kaneko T, Tabata S (1997) Complete genome structure of the unicellular cyanobacterium 
Synechocystis sp. PCC6803. Plant Cell Physiol 38:1171-1176 




104 



Genetics 



Kaneko T, Sato S, Kotani H, Tanaka A, Asamizu E, Nakamura Y, Miyajima N, Hirosawa M, 
Sugiura M, Sasamoto S, Kimura T, Hosouchi T, Matsuno A, Muraki A, Nakazaki N, Naruo 
K, Okumura S, Shimpo S, Takeuchi C, Wada T, Watanabe A, Yamada M, Yasuda M, 
Tabata S (1996) Sequence analysis of the genome of the unicellular cyanobacterium 
Synechocystis sp. strain PCC6803. II. Sequence determination of the entire genome and 
assignment of potential protein-coding regions. DNA Res 3:109-136 
Klose J (1999) Genotypes and phenotypes. Electrophoresis 20:643-652 
Roller A, Washburn MP, Lange BM, Andon NL, Deciu C, Haynes PA, Hays L, Schieltz D, 
Ulaszek R, Wei J, Wolters D, Yates JR 3rd (2002) Proteomic survey of metabolic pathways 
in rice. Proc Natl Acad Sci USA 99:11969-11974 
Kotani H, Tabata S (1998) Lessons from sequencing of the genome of a unicellular cyano- 
bacterium, Synechocystis SP. PCC6803. Annu Rev Plant Physiol Plant Mol Biol 
49:151-171 

Link AJ, Eng J, Schieltz DM, Carmack E, Mize GJ, Morris DR, Garvik BM, Y ates JR 3rd ( 1 999) 
Direct analysis of protein complexes using mass spectrometry. Nat Biotechnol 
17:676-682 

Mann M, Wilm M (1994) Error-tolerant identification of peptides in sequence databases by 
peptide sequence tags. Anal Chem 66:4390-4399 
Martin W, Rujan T, Richly E, Hansen A, Cornelsen S, Lins T, Leister D, Stoebe B, Hasegawa 
M, Penny D (2002) Evolutionary analysis of Arabidopsis, cyanobacterial, and chloroplast 
genomes reveals plastid phylogeny and thousands of cyanobacterial genes in the nucleus. 
Proc Natl Acad Sci USA 99:12246-12251 

Moseley JL, Allinger T, Herzog S, Hoerth P, Wehinger E, Merchant S, Hippier M (2002) 
Adaptation to Fe-deficiency requires re-modelling of the photosynthetic apparatus. 
EMBOJ 21:6709-6720 

O’Farrell P (1975) High resolution two-dimensional electrophoresis of proteins. J Biol Chem 
250:4007-4021 

Peltier JB, Friso G, Kalume DE, Roepstorff P, Nilsson F, Adamska I, van Wijk KJ (2000) 
Proteomics of the chloroplast: systematic identification and targeting analysis oflumenal 
and peripheral thylakoid proteins. Plant Cell 12:319-341 
Peltier JB, Ytterberg J, Liberies DA, Roepstorff P, van Wijk KJ (2001) Identification of a 
350-kDa ClpP protease complex with 10 different Clp isoforms in chloroplasts oiArabi- 
dopsis thaliana. J Biol Chem 276:16318-16327 
Peltier JB, Emanuelsson O, Kalume DE, Ytterberg J, Friso G, Rudella A, Liberies DA, 
Soderberg L, Roepstorff P, von Heijne G, van Wijk KJ (2002) Central functions of the 
lumenal and peripheral thylakoid proteome of Arabidopsis determined by experimen- 
tation and genome-wide prediction. Plant Cell 14:211-236 
Peng J, Gygi SP (2001) Proteomics: the move to mixtures. J Mass Spectrom 36:1083-1091 
Perkins DN, Pappin DJ, Creasy DM, Cottrell JS (1999) Probability-based protein identifica- 
tion by searching sequence databases using mass spectrometry data. Electrophoresis 
20:3551-3367 

Puig O, Caspary F, Rigaut G, Rutz B, Bouveret E, Bragado-Nilsson E, Wilm M, Seraphin B 
(2001) The tandem affinity purification (TAP) method: a general procedure of protein 
complex purification. Methods 24:218-229 

Rujan T, Martin W (2001) How many genes in Arabidopsis come from cyanobacteria ? An 
estimate from 386 protein phylogenies. Trends Genet 17:113-121 
Santoni V, Molloy M, Rabilloud T (2000) Membrane proteins and proteomics: un amour 
impossible? Electrophoresis 21:1054-1070 

Schubert M, Petersson UA, Haas BJ, Funk C, Schroder WP, Kieselbach T (2002) Proteome 
map of the chloroplast lumen of Arabidopsis thaliana. J Biol Chem 277:8354-8365 
The Arabidopsis Genome Initiative (2000) Analysis of the genome sequence of the flowering 
plant Arabidopsis thaliana. Nature 408:796-815 
van Wijk KJ (2000) Proteomics of the chloroplast: experimentation and prediction. Trends 
Plant Sci 5:420-425 




Extranuclear Inheritance 



105 



Washburn MP, Wolters D, Yates JR 3rd (2001) Large-scale analysis of the yeast proteome 
by multidimensional protein identification technology. Nat Biotechnol 19:242-247 
Whitelegge JP, Gundersen CB, Faull KF (1998) Electrospray- ionization mass spectrometry 
of intact intrinsic membrane proteins. Protein Sci 7:1423-1430 
Yamaguchi K, Subramanian AR (2000) The plastid ribosomal proteins. Identification of all 
the proteins in the 50S subunit of an organelle ribosome (chloroplast). J Biol Chem 
275:28466-28482 

Yamaguchi K, von Knoblauch K, Subramanian AR (2000) The plastid ribosomal proteins. 
Identification of all the proteins in the 30S subunit of an organelle ribosome (chloro- 
plast). J Biol Chem 275:28455-28465 

PD Dr. Michael Hippier 
Friedrich-Schiller-Universitat Jena 
Institut fur Allgemeine Botanik 
Dornburger Strasse 159 
07743 Jena, Germany 
Tel: +49-3641-949237 
Fax: +49-3641-949232 
e-mail: m.hippler@uni-jena.de 

Prof. Dr. Ralph Bock 

Westfalische Wilhelms-Universitat Munster 
Institut fur Biochemie und 
Biotechnologie der Pflanzen 
Hindenburgplatz 55 
48143 Munster, Germany 
Tel: +49-251-83-24790/91 
Communicated by Fax: +49-25 1 -8328371 

K. Esser e-mail: rbock@uni-muenster.de 




Genetics 



Molecular Cell Biology: 

Organization and Molecular Evolution of rDNA, 
Nucleolar Dominance, and Nucleolus Structure 

Roman A. Volkov, Francisco Javier Medina, Ulrike Zentgraf, 
and Vera Hemleben 



1 Introduction 

Recently, the ribosomes - molecularly well-characterized cell organelles 
where the process of translation of messenger RNA into polypeptides 
occurs - have attracted new interest since they can be considered as a huge 
ribozyme-like complex consisting of different proteins and RNA - the 
RNAs mainly fulfilling functional tasks in translation, the proteins serving 
more structural functions (Moore and Steitz 2002). Within the eukaryotic 
cell, two (animals and fungi) or three (plants) sites of a translational 
machinery are present, the cytoplasm, mitochondria and chloroplasts, and, 
respectively, the components are mostly encoded in the cell nucleus or the 
mitochondrial or chloroplast genome. Since new insights are gained into 
the production and the assembly of the ribosome components, ribosomal 
RNA (rRNA) and ribosomal proteins (r-proteins), in this chapter, we focus 
on the nucleolus, the site for cytoplasmic ribosome biogenesis and assembly 
occurring in the light-microscopically visible structure of the cell nucleus. 
Similarly as in prokaryotes, for all higher organisms two ribosome subunits 
are preformed in the nucleolus, which are evolutionary conserved: the small 
40S subunit (SSU) containing 18S rRNA, and the large 60S subunit contain- 
ing 25/28S plus 5.8S and 5S rRNA. After export of the ribosome subunits 
(LSU) into the cytoplasm, the functionally active ribosome is associated 
with translatable mRNA to an 80S complex corresponding to the 70S 
structure in chloroplasts and prokaryotic cells. 

Following the ribozyme concept, ribosome subunits are built up by 
rRNA and r-proteins. The nuclear encoded rRNA genes (rDNA) are present 
as a multigene family in the genome, mostly located at the NOR (nucleo- 
lus-organizing region). In the transition area from fibrillar centers to the 
dense fibrillar component (FC/DFC) of the nucleolus, the 18S, 5.8S and 
25S/28S rRNA are commonly transcribed by RNA polymerase I (pol I) as a 
large precursor, which in plants is approx. 35S in size (35S pre-rRNA); this 
35S pre-rRNA is then stepwisely processed into the respective rRNAs. RNA 
polymerase III (pol III) transcribes the 5S rRNA component encoded by 5S 
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rDNA located at other regions of the genome for most higher organisms. 
For processing of the pre-rRNA, small nucleolar RNAs (snoRNAs) genes 
are transcribed by either RNA polymerase II (pol II) or pol III. For methy- 
lation of rRNA and structural functions, the mRNA for enzymatic or 
structural proteins supporting the processing as snoRNPs (small nucleolar 
RNA/protein complexes) is produced by pol II. During the process of 
ribosome biogenesis, the rRNAs are associated with r-proteins to pre-ribo- 
somes. Ribosomal proteins are encoded in the nuclear genome and tran- 
scribed by pol II into mRNA; this mRNA is translated into r-proteins on 
cytoplasmic ribosomes, which are mostly retransported to the nucleolus, 
where assembly of pre-ribosomes takes place, thus forming the granular 
component (GC). Therefore, we can expect a coordinated cross-talk not 
only between the different transcriptional and translational machineries 
between the cell nucleus and the cytoplasm, but also among the different 
cell organelles. 

From extensive molecular studies of plant rRNA genes, we now have information on the 
structural organization of the multigene families, often comprising between 500 and more 
than 30,000 members, depending on the plant species (Hemleben et al. 1988), and their use 
for evolutionary and phylogenetic studies became evident. However, our knowledge on the 
regulation of transcription by specific factors and processing for animal and yeast cells is 
by far more detailed than for plant ribosomal genes. 

Ribosomes can be considered as basic organelles of the cell; their compo- 
nents have been generally termed as fulfilling “house-keeping” functions 
and the biosynthesis of the respective components as occurring constitu- 
tively. Recently, more and more information is obtained on their highly 
regulated and finely tuned expression realized by transcriptional enhance- 
ment or by “epigenetic” processes like gene silencing. Especially for higher 
plants, where interspecific hybridization accompanied by polyploidization 
occurs in nature, the phenomenon of “nucleolar dominance”, defined as 
active transcription of one partner rDNA and silencing of the respective 
other partner rDNA (Pikaard 2000), allows more insight into the regulation 
processes. 

The new field of proteomics revealed more than 230 nucleolar proteins 
for humans and yeast, but up to now fewer than 100 nucleolar proteins have 
been identified in functional terms. Therefore, a significant proportion of 
these proteins are suggested to be involved in functions beyond the known 
role of the nucleolus in ribosome biogenesis (Andersen et al. 2002; Scherl 
et al. 2002), supporting the idea of a “plurifunctional nucleolus” (Pederson 
1998). 

Especially for higher plants, in the last decade, specific proteins gained much interest as 
therapeutic agents acting as anticancer or antiviral compounds: Ribosome-inactivating 
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proteins (RIPs) are potent plant toxins with RNA N-glycosidase activity. They are strong 
inhibitors of eukaryotic protein synthesis that inactivate eukaryotic ribosomes by cleaving 
the N-glycosidic bond of a specific adenylate (A) residue in the 25/28S rRNA at the 
oc-sarcin/ricin loop. Type 1 and type 2 RIPs were characterized for various plant species. 

Here, we review and summarize results on the organization and transcrip- 
tional regulation of the nuclear-encoded rRNA genes and their use for 
evolutionary and phylogenetic studies, and emphasize nucleolar proteins, 
the nucleolus structure, and ribosome assembly, in order to supplement 
recent reviews on rDNA regulation and function (Hemleben and Zentgraf 
1994; Zentgraf et al. 1998). 



2 Structural Organization and Molecular Evolution of rDNA 
2.1 Organization of 35S rDNA 

The nuclear genes coding for the 18S, 5.8S, and 25/28S rRNA, belonging to 
the class of repeated sequences, are arranged in head-to-tail tandem arrays 
(with only rare exceptions) and located in one or more places in the genome. 
The transcribed genes, which form the NOR, can be cytologically distin- 
guished as secondary constrictions (SC) in mitosis/meiosis or as nucleolus 
in interphase nuclei (Leweke and Hemleben 1982; Hemleben and Zentgraf 
1994; Moss and Stefanovsky 1995). A rDNA repeat unit is composed of the 
rRNA coding regions, external and internal transcribed spacers (ETS and 
ITS), and non-transcribed spacer (NTS) regions (Fig. 1). The order of the 
coding regions is universal in all eukaryotes. Additionally, in rRNA-coding 
regions inserted sequences were described for insects (England et al. 1988; 
Bigot et al. 1992) and green algae (Wilcox et al. 1992). 

Different portions of the rDNA repeat unit evolve with different rates. The coding regions 
represent one of the most conservative sequences in eukaryotes (Lipscomb et al. 1998), 
which is obviously a result of a strong selection against any loss-of-function mutations in 
components of the ribosome subunits (Caetano-Anolles 2002; Moore and Steitz 2002). The 
most conservative part appears to be the 3’ end of 25S rDNA representing the a-sarcin/ricin 
(S/R) loop (see below). Similarly, a conservation of some sequence motifs and/or secondary 
structures in other subregions of rDNA/rRNA indicates a functional significance. For 
instance, sequence motifs conserved in the ITS seem to be necessary for interaction with the 
proteins involved in processing of the 35S pre-rRNA (Torres et al. 1990; Liu and Schardl 
1994; Mai and Coleman 1997). 

In contrast to the coding regions, the intergenic spacer (IGS) region evolves rather 
quickly; nearly no sequence similarity can be found in this region for members of different 
plant families (Hemleben and Zentgraf 1994). An exception represents a sequence motif for 
the transcription initiation site (TIS), TATATA(A/G)GGG in dicots and TATAG- 
TAGGG(A/G) in monocots (Perry and Palukaitis 1990; Zentgraf et al. 1990; Doelling and 
Pikaard 1995; Volkov et al. 1996; Akhunov et al. 2001). Another comparatively conservative 
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portion of the IGS is a region adjacent to the 18S rDNA in different families of dicots (King 
et al. 1993; Volkov et al. 1999b). These motifs could play a role in pre-rRNA processing. 

A specific feature of the IGS is not only a higher rate of base substitutions, but also a 
remarkable length variability (Rogers and Bendich 1987a; Hemleben et al. 1988). Several 
length variants of IGS can be present in the same genome (Hemleben et al. 1988; Zentgraf 
et al. 1990). Species of the same genus often differ significantly in the length of the IGS: e.g. 
from 1.7 to 6.4 kb in Trillium (Yakura et al. 1983), from 2.6 to 6.5 kb in Prunus (Volkov et 
al. 1993) or from 3.4 to 5.9 kb in Nicotiana (Borisjuk et al. 1997). Moreover, even for 
individuals of the same population, the length of the IGS can differ (Learn et al. 1987; Rogers 
and Bendich 1987a). For instance, high variability of IGS was found for several species of 
family Fabaceae: Viciafaba (Kato et al. 1985; Rogers and Bendich 1987b), Pisum sativum 
(Polans et al. 1986), and Phaseolus (Gerstner et al. 1988; Schiebel et al. 1989; Maggini et al. 
1992). However, there are also examples of low variability or even uniformity of IGS length 
among species of the same genus (Choumane and Heizmann 1988; Santoni and Bervielle 
1992; Borisjuk et al. 1994). The degree of IGS variability could be connected with character 
of pollination: In the genus Hordeum , cross-pollinated species mainly possess one length 
variant of the IGS, whereas self-pollinated species show intragenomic polymorphisms 
(Molnar et al 1989); however, such a correlation was not found for Clematis or Rudbeckia 
(Learn et al. 1987; King and Schaal 1989). 

The length variability of IGS is mainly due to amplification of different 
subregions that results in the appearance of subrepeated elements within 
the IGS (see Fig. 1; Hemleben and Zentgraf 1994). 

Normally, several classes of short subrepeated elements are present upstream of TIS (e.g. 
Kato etal. 1985; Kelly and Siegel 1989;Delsenyetal. 1990; Perry and Palukaitis 1990;Grtindler 
et al. 1991; Borisjuk and Hemleben 1993; King et al. 1993; Volkov et al. 1996; Borisjuk et al. 
1997). This portion of IGS accumulates base substitutions with a high rate: obvious sequence 
similarity can be found here only among members of the same genus or closely related 
genera. In contrast, downstream of TIS, subrepeats are not always present. Downstream 
subrepeats accumulate base substitutions more slowly. Interestingly, different sequences 
were independently amplified downstream of TIS in Nicotiana and Lycopersicon , producing 
subrepeats of nearly the same size, 140-145 bp (Volkov et al. 1996; 2003; Borisjuk et al. 1997). 
Amplification of subrepeats in IGS occurred several times during evolution. For instance, 
in the genus Nicotiana , original duplication of the ancestral A-subrepeat sequence in the 
ETS took place before the divergence of subgenera, and produced two sub variants, Al and 
A2. Additional amplification during later speciation of Nicotiana formed longer stretches 
of Al/A2-subrepeats independently in different species (Volkov et al. 1996). Similarly, 
several rounds of amplification/deletions generated a “chaotic mix” of C-subrepeats up- 
stream of TIS (Volkov et al. 1999a). 

In spite of length and sequence variability, the general arrangement of IGS 
often remained unaltered among members of the same family. Comparison 
of IGS organization in distantly related Solanaceae, Lycopersicon esculen- 
tum (Perry and Palukaitis 1990), Solanum tuberosum (Borisjuk and Hem- 
leben 1993), Atropa belladonna (Volkov et al. 1997), and three species of 
Nicotiana (Borisjuk et al. 1997; Volkov et al. 1999a), shows that all these 
species possess (1) a unique AT-rich sequence in the central part of IGS, 
immediately upstream of TIS, (2) subrepeated elements upstream of TIS, 
and (3) similar localization of stem-loop structures (Fig. 1). 
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Fig. 1. Structural organization of 35S rDNA. A Schematic presentation of structural organi- 
zation of rDNA, expression and processing. B Organization and sequence similarity of the 
intergenic spacers (IGS) of rDNA of different Solanaceae: Ntm , N. tomentosiformis ; Nbt, N. 
tabacum ; Ns, N. sylvestris; Ab, Atropa belladonna; St, S. tuberosum; Le, L. esculentum. 
Percent of similarity for different IGS subregiones are given. TIS Transcription initiation 
site; TTS transcription termination site; ETS external transcribed spacer; ITS internal 
transcribed spacer; NTS non-transcribed spacer 
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2.2 Organization of 5S rDNA 

In eukaryotes, the 5S rDNA also belongs to the repeated sequence classes 
and is arranged in head-to-tail tandem arrays. Usually, 5S rDNA and 18-25S 
rDNA are located at different locations of the same chromosome or often 
at different chromosomes. The internal structural organization of 5S rDNA 
repeated unit is comparatively simple: Each repeat is composed of a con- 
served, approx. 120-bp-long coding region, and of a more variable inter- 
genic spacer region (5S SR) (Hemleben and Werts 1988; Ellis et al. 1988; 
Scoles et al. 1988; Lapitan et al. 1991). 

The high conservation of the 5S rRNA coding region exhibits some family- specificity, 
especially at the 5’end: 5S rRNA genes begin with GGA in Solanaceae (Frasch et al. 1989; 
Barciszewska et al. 1994; Volkov et al. 2001), with AGG in Fabaceae, and GGG in Brassicaceae 
(Hemleben and Werts 1988; Barciszewska et al. 1994) and in Poaceae (Van Campenhout et 
al. 1998; Roser et al. 2001). The 5S SR can be subdivided into three subregions: 3’ and 5* 
flanking sequences (3* and 5’ FS) and middle variable region (VR). In comparison to VR, 
both 3’ and 5’ FS are more conservative, suggesting presumptive functional significance 
(Hemleben and Werts 1988; Van Campenhout et al. 1998; Crisp et al. 1999; Trontin et al. 
1999; Roser et al. 2001; Volkov et al. 2001). In the VR, two further portions can be distin- 
guished: (1) the AT-rich and (2) the subrepeated regions. Rearrangements in VR are 
preferentially associated with the repeated motifs. 



2.3 Molecular Evolution and Taxonomic Applications of rDNA 

Concerted evolution, which enhances sequence similarity between mem- 
bers of multigene families, is a specific feature of repeated genome ele- 
ments, including both 5S and 35S rDNA (Coen et al. 1982; Dover and Flavell 
1984; Dvorak et al. 1987; Hemleben et al. 2000; Ganley and Scott 2002). A 
new mutation appearing in a single repeat unit has to be either quickly 
eliminated from the genome or distributed through all other repeats of the 
family. Although numerous copies of rDNA are present in the genome, 
concerted evolution often results nearly in identity of individual repeats of 
35S (Volkov et al. 1996, 1999a; Linder et al. 2000) and 5S rDNA (Volkov et 
al. 2001). 

Detailed comparison of numerous ETS clones isolated for 30 Solanum species (Volkov et al. 
2003) demonstrated an unusually high level of identity: More than 99% sequence similarity 
was found for ten diploid species. For two allopolyploids, intergenomic sequence similarity 
was lower (93.5-97.7%), demonstrating that homogenization of rDNA inherited from 
parental species may take time. Remarkably, in two other polyploids, the rDNA repeats 
appear to be very homogeneous: 99.1-99.6%, considering the following reasons: (1) the 
polyploids appeared as a result of hybridization between closely related diploids; (2) the 
rDNA introduced by one of the parents was eliminated from the genome, similar to the 
situation described for the allotetraploid N. tabacum (Volkov et al. 1999a; see also below). 
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Efficiency of homogenization by concerted evolution depends on the local 
distance between repeated units (Dvorak et al. 1987). Particularly, 35S 
rDNA repeats within the same NOR are more similar to one another than 
repeats from different NORs, as it was originally shown for Drosophila 
(Schlotter and Tautz 1994). In Arabidopsis thaliana , a common IGS length 
variant was found among 35S rDNA repeats at NOR2 located on chromo- 
some 2, whereas NOR4 on chromosome 4 contains three length variants 
(Copenhaver and Pikaard 1996). The NOR4 variants are not intermingled 
with one another, but highly clustered, suggesting that in the concerted 
evolution of rRNA genes homogenization is a consequence of local spread- 
ing of new rDNA variants. On the other hand, interspersion of 5S repeats 
of different length was observed in pea (Ellis et al. 1988) and in Larix 
(Trontin et al. 1999), suggesting different efficiency of concerted evolution 
of different repeated sequences. 

Homogenization takes place not only between the complete copies of 35S rDNA repeats, but 
also between the subrepeats within the same repeat unit. Remarkably, the border subrepeat 
units are more divergent from the corresponding consensus sequence than the central ones, 
indicating that the homogenization process is more effective in the central portion of 
subrepeated region (Barker et al. 1989; Schiebel et al. 1989; Volkov et al. 1996). These 
observations agree with calculations derived from computer modeling of tandem arrays 
(Dvorak et al. 1987) and support the assumption that unequal crossing over is involved in 
the homogenization of repeated sequences. 

Due to the existence of regions evolving with different rates, 35S rDNA 
represents a very attractive tool for molecular taxonomy. In addition, due 
to the higher similarity of individual repeat units found not only in coding, 
but also in spacer regions, it is often not necessary to sequence many 
individual clones; even application of direct sequencing could produce 
correct results. Coding regions with the lowest rate of evolution were 
successfully used for phylogenetic studies of distantly related taxa (Lip- 
scomb et al. 1998), whereas comparison of more rapidly evolving ITS 1 and 
2 was widely applied for taxonomic reconstructions among members of the 
same or closely related genera, e.g. of Compositae (Baldwin 1992), Cucur- 
bitaceae (Jobst et al. 1998), Aveneae (Grebenstein et al. 1998), Phaseolus , 
Vigna (Goel et al. 2002), and many others. Again, in some groups, ITS does 
not appear to be phylogenetically informative. In such cases, comparison 
of ETS (Volkov et al. 1996, 2003; Baldwin and Markos 1998; Bena et al. 1998; 
Linder et al. 2000) can be used. In many plants, both ITS and ETS evolve 
preferentially by stepwise base substitutions allowing correct sequence 
alignments and successful phylogenetic reconstructions even for closely 
related species (Denk et al. 2002). However, specific indels in the 35S 5’ ETS 
can be used also as taxonomically relevant characters for separation of 
major taxonomic groups (Volkov et al. 2003). 
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Furthermore, 5S rDNA can be a valuable molecular marker for phyloge- 
netic reconstructions at different taxonomic levels, as widely used already 
for the coding region; but also the spacer region contains phylogenetic 
information for low-level taxonomy (Roser et al. 2001; Volkov et al. 2001; 
and above). 



3 Transcriptional Regulation of rDNA 

3.1 Promoter Structure and Transcription Initiation of 35S rDNA 

Transcription of 35S rDNA is specifically carried out by pol I, starting from 
the TIS in the IGS, synthesizing the 35S pre-rRNA, and finishing at the 
transcription termination site (TTS). Hence, the rRNA precursor contains 
not only sequences of 18S, 5.8S, and 25-28S rRNA, but also of 5’ and 3’ ETS 
and of two ITS, which are removed in several subsequent steps during the 
processing (see Fig. 1; reviewed by Zentgraf et al. 1998). 

In situ immunolocalization methods detected pol I in the nucleolar FC, and, in the onion 
nucleolus, in the transition area between FC and the DFC as well as in the mitotic secondary 
constriction of chromosomes, which corresponds to the NOR (Scheer and Rose 1984; Martin 
and Medina 1991; Gilbert et al. 1995). Interestingly, although the enzyme was unequivocally 
found to be rDNA-bound, this fact was not necessarily associated with its current involve- 
ment in active transcription (Scheer and Rose 1984; Weisenberger and Scheer 1995). 

In different eukaryotic species, pol I was purified as a multimeric complex formed by 
many subunits (Sentenac 1985). In the yeast Saccharomyces cerevisiae , 14 proteins ranging 
in mass from 8.3 to 186 kDa were found, 7 of which are specific for pol I, whereas the other 
7 represent components common also for pol II and/or pol III (Carles and Riva 1998). 
Phosphorylation of several subunits and binding of zinc seem to be involved in activity 
regulation and stabilization of pol I. Probably, three core subunits, A 190, A 135 and AC40, 
which are structurally and functionally related to the eubacterial RNA polymerase core 
enzyme, provide the main basal functions, i.e. template-dependent polymerization itself, 
whereas regulatory and/or species-specific aspects are connected with the remaining non- 
core subunits. In contrast to yeast and animals, much less is known about pol I in plants: 
Earlier, the chromatin-bound enzyme was isolated from cauliflower inflorescence (Guilfoyle 
1980), and a putative pol I holoenzyme complex was purified from broccoli, Brassica 
oleracea. This cell-free system was able to support promoter-dependent pol I transcription 
in vitro (Saez-Vasquez and Pikaard 1997). The complex is composed of more than 30 
polypeptides, including protein kinase, histone acetyltransferase and topoisomerase (Albert 
et al. 1999; Hannan et al. 1999; Saez-Vasquez et al. 2001). 

It is known for animals and yeast that pol I requires additional transcription 
initiation factors (TIFs) for its function and several transcription-associ- 
ated factors (TAFs) for interaction with the promoter. The rDNA promoter 
region consists of two functional subregions: core promoter (CP, from 
about -40 to +10 bp) and upstream element (UE, up to -150 bp). The only 
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well-studied species in which UE has not been identified is Acanthamoeba 
castellanii , but this may be because the CP in this species is too strong to 
allow detection of UE in vitro (Paule 1998a, b). Although UE is not abso- 
lutely required for transcription initiation, it stimulates the intensity and 
provides species-specificity of expression (Heix and Grummt 1995; for 
review see: Moss and Stefanowsky 1995; Paule 1998a; Zentgraf et al. 1998; 
Grummt 1999). Direct binding of protein factors to UE seems to be the 
initial step of preinitiation complex formation for pol I, whereas several 
other TAFs are subsequently recruited to the promoter (Schnapp and 
Grummt 1991; Comai et al. 1992; Moss and Stefanowsky 1995; Steffan et al. 
1996; Grummt 1999). Interestingly, in spite of a similar function, TAFs 
differ essentially in different taxonomic groups. In yeast, several proteins 
interact with UE, being substituted in vertebrates by a single large upstream 
binding factor (UBF), which contains multiple DNA binding domains 
known as HMG boxes due to their similarity to HMG non-histone chromo- 
somal proteins (Reeder 1990; Schnapp and Grummt 1991; Zentgraf et al. 
1998; Grummt 1999). 

In contrast, little is known about the rDNA promoter structure and TAFs 
in plants (for review see Hemleben and Zentgraf 1994; Zentgraf et al 1998). 
In A. thaliana , a sequence between -55 and -33 bp upstream and +6 bp 
downstream of TIS was found to be sufficient for accurate pol I transcrip- 
tion; no domains equivalent to UE were found (Doelling and Pikaard 1995, 
1996; Saez-Vasquez and Pikaard 1997). The authors speculate that plants 
may have simple rRNA promoters equivalent to the core promoters in 
non-plant systems (Pikaard 2002). Pol I could exist as a multiprotein 
holoenzyme, which can associate with the promoter in a single DNA bind- 
ing event (Saez-Vasquez and Pikaard 2000). However, a longer promoter 
sequence, between -113 and +15 bp, was necessary to direct pol I-depend- 
ent transcription in Triticum species (Akhunov et al. 2001), suggesting the 
presence of UE also in plant pol I promoters. 

Similarly, it remains unclear if any homologues of yeast or animal TAFs are present in plants. 
Remarkably, autoantiserum against human UBF is capable of detecting the homologous 
protein in onion cells. The protein appeared to be preferentially located in FCs and on the 
transition area FC/DFC, but a significant amount was found in the DFC (Rodrigo et al. 1992; 
De Career and Medina 1999). However, it should be additionally clarified, if the respective 
plant protein represents a real homologue of mammalian UBF, or, alternatively, some other 
HMG protein, which cross-reacts with the antibody used. According to our preliminary 
results, at least five proteins, ranging in size from 34 to 120 kDa, interact with the promoter 
sequence from -165 to -97 bp upstream of TIS in tobacco (R.A. Volkov, V. Hemleben, 
unpubl.). 
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3.2 Modulation of rDNA Transcription 

Different subregions of IGS other than the promoter can be involved in the 
regulation of rDNA transcription. AT-rich regions found in the IGS of 
different plants upstream of TIS represent presumptive enhancers (Boris- 
juk et al. 2000); similar sequences also improve the transcription of protein 
coding genes (Raschke et al. 1988; Schoffl et al. 1993; Sandhu et al. 1998). 
Besides, similarly to Xenopus (Moss 1983; Reeder 1984; Pape et al. 1989; 
Mougey et al. 1996), upstream subrepeats in plants could also represent 
enhancers (Flavell 1986; Zentgraf et al. 1998). In transgenic Arabidopsis 
plants, 35S rDNA upstream subrepeats increased transcription of a reporter 
gene up to four times if cloned upstream of the 35S minimal promoter 
(Schlogelhofer et al. 2002). However, the question still remains unanswered 
because later experiments with Xenopus did not confirm the enhancer 
function of upstream subrepeats (Caudy and Pikaard 2002). The functional 
role of downstream subrepeats also remains mostly unclear, however, new 
data demonstrate a role in the control of nucleolar dominance in potato- 
tomato hybrids (N.Y. Komarova, T. Grabe, V. Hemleben, R.A. Volkov, 
unpubl.). 



3.3 Transcription Termination 

Termination of pol I requires the interaction of a specific DNA binding 
factor with a terminator element downstream of the 3’end of 26S rDNA. 
These elements and corresponding protein factors are different in yeast and 
vertebrates, but the transcription termination factor of yeast, Reblp, can 
function in vitro in combination with murine release factor, suggesting that 
the mechanism of pol I termination is highly conserved from yeast to 
mammals (Mason et al. 1997). In vivo analyses showed that in Schizosac- 
charomyces pombe the termination process strongly resembles the single 
element-mediated mechanism initially reported for mouse and is not de- 
pendent on an additional upstream sequence as first reported in yeast 
(Melekhovets et al. 1997). In the latter species, a stem-loop structure imme- 
diately downstream of the 25S rRNA region is necessary and sufficient for 
processing of the 3’ETS. Remarkably, the same stem loop is required for 
processing of ITS 1. This could be necessary to prevent processing of rRNA 
transcripts which are incomplete due to premature termination (Allmang 
and Tollervey 1998). 

Still, termination of transcription of 35S rDNA in plants is poorly described. Position of 
transcription termination downstream of the pre-rRNA coding region was determined for 




116 



Genetics 



several plants (for review see Hemleben and Zentgraf 1994). Although a region with 70% 
similarity to a terminator sequence in Xenopus rDNA was found near the termination signal 
in Vigna radiata (Schiebel et al. 1989), a consensus sequence for plants representing different 
families cannot be defined supposing a species specificity of rDNA transcription termina- 
tion. However, in all studied genera of the family Solanaceae, a motif GAGGTTTTT appeared 
to be strongly conserved, indicating a presumptive involvement in transcription termina- 
tion (Fig. 1). Additionally, termination of rRNA transcription could be controlled by a higher 
order stem-loop structure, as found in the IGS of Vigna radiata (Schiebel et al. 1989), 
Cucumis sativus (Zentgrafetal. 1990), and also in species of Solanaceae (Volkov etal. 1999b). 



3.4 Replication of 35S Ribosomal DNA 

and Amplification Promoting Sequences 

In eukaryotes, AT-rich sequence elements from IGS are not only involved 
in transcription initiation, but also participate in initiation of replication 
(Hernandez et al. 1988, 1993; Vanf Hof and Lamm 1992) and in amplifica- 
tion of rDNA repeats (Wegner et al. 1989; Hemann et al. 1994). Similarly, 
in AT -rich regions of potato and tobacco, a sequence element was identified 
which demonstrated a similarity to the muNTSl, an amplification-promot- 
ing sequence (APS) from the AT-rich region of mouse rDNA (Wegner et al. 
1989). A similar function was suggested for plants, because in transgenic 
tobacco such an element was able to increase the copy number of adjacent 
reporter genes (Borisjuk et al. 2000). Short sequence motifs 
(A/T)TTTAT(A/G)TTT(A/T) appeared to be conserved in the APS of to- 
bacco, potato, and mouse, and also in autonomous replication sequences 
(ARS) of yeast (Linkens and Hubermann 1988; Hernandez et al. 1993), 
indicating that amplification of rDNA repeats could function via repeated 
cycles of local replication (Hernandez et al. 1988; Borisjuk et al. 2000). 

Another function of AT-rich regions maybe an interaction with the nuclear matrix (Cock- 
erill and Garrard 1986; Gasser and Laemmli 1986; Boulikas 1993). Remarkably, the same 
function could have TG-rich regions, present in the NTS of Cucurbita maxima (Kelly and 
Siegel 1989; King et al. 1993) and Nicotiana species (Borisjuk et al. 1997; Volkov et al. 1999b) 
because GT-rich stretches, usually 6-12 bp long, were found in nuclear matrix- or scafford- 
attached regions (SARs or MARs) of several genes (Boulikas 1993). Interestingly, GT-rich 
regions are completely absent in the NTS of Solanum and Atropa (Fig. 1). Recently, electron 
microscopic studies using anti-DNA antibody confirm that the NTS represents the anchor- 
ing site of Allium cepa rDNA (Yano and Sato 2002). Hence, AT-rich regions in rDNA 
probably provide (1) transcription enhancement, (2) rDNA replication and amplification, 
and (3) interaction with nuclear matrix, three functions which could be at least partially 
facilitated by topoisomerase II, one of the major scaffold proteins. 
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3.5 Functional Organization of 5S rDNA 

In eukaryotes, 5S rDNA is transcribed by pol III which, as shown for animals 
and yeast, requires transcription factors TF IIIA, IIIB, and IIIC. TF IIIA, a 
zinc-finger protein, binds specifically to the internal control region (C-box) 
of 5S rRNA genes, which is followed by the sequential binding of TF IIIC, 
IIIB and pol III (Engelke et al. 1980; Pelham and Brown 1980; Ryan and 
Darby 1998). For plants, TF IIIA was purified and characterized from Tulipa 
whittaili, showing functional similarity of plant and amphibian factors 
(Wyszko and Barciszewska 1997); the factor-binding region appears also to 
be rather conserved (Hemleben and Werts 1988). In the 5S rDNA spacer 
some signals involved in transcription regulation are present as well: At the 
position -28 to -24 bp a conserved “TATA” box was found in distantly 
related Vigna radiata and Matthiola incana (Hemleben and Werts 1988). 
In Solanum, a similar TAATA motif was described at the same position, and 
in the 3’ FS of 5S SR a pyrimidine-rich stretch probably functioning as 
termination site was found (Volkov et al. 2001). In the VR of 5S SR two 
portions can be distinguished: (1) the AT-rich and (2) the subrepeated 
regions. The AT-rich region shows similarity to the amplification-promot- 
ing sequences (Borisjuk et al. 2000) and may be involved in amplification 
of 5S rDNA repeats. Similarly, AT-rich segments were also observed in the 
5S SR of Fabaceae (Hemleben and Werts 1988) and Poaceae (Roser et al. 
2001). A function for the subrepeated region is not clear yet; however, 
repeated motifs present here show obvious homology to the box C, an 
internal control region of 5S rRNA genes (Pieler et al. 1987; Hemleben and 
Werts 1988). 



4 Ribosomal DNA in Interspecific Hybrids 
4.1 Nucleolar Dominance 

A fascinating epigenetic phenomenon connected with the nuclear-encoded 
35S rDNA, termed “nucleolar dominance” (ND; Honjo and Reeder 1973), 
was originally described as “differential amphiplasty” by Navashin (1928, 
1934), who showed with cytological methods that in interspecific hybrids 
of Crepis only chromosomes of one crossing partner carry secondary con- 
strictions in metaphase and produce active nucleoli in interphase. Obvi- 
ously, the respective chromosomal regions were not lost in hybrids, but 
could be reactivated to produce normal nucleoli in hybrids with a different 
crossing partner. Based on these data, McClintock (1934) proposed that if 
the NOR of species A dominates over NOR of species B, and if B dominates 
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over C, then NOR of species A should dominate over C. The existence of 
such a hierarchy for Crepis species was later confirmed experimentally 
(Wallace and Langbridge 1971). ND was demonstrated also for many other 
organisms, both plants and animals (for review see: Reeder 1985; Pikaard 
2000 ). 

With respect to current knowledge about the molecular functions of 
NOR, a differential transcription/silencing of parental 35S rDNA in hybrids 
could be anticipated, as it was demonstrated for hybrids of Xenopus (Honjo 
and Reeder 1973). In order to explain ND at the molecular level, several 
hypotheses were proposed. According to the species-specific transcription 
factor hypothesis (Reeder 1974, 1985; Dover and Flavell 1984; Saghai-Ma- 
roof et al. 1984), a rapid molecular evolution of the rDNA promoter region 
should be accompanied by the co-evolution of protein transcription factors 
interacting with the promoter and/or one with another. If so, inactivation 
of a species-specific transcription factor in hybrids should result in ND. 
This idea was confirmed for human and murine 35S rDNA promoters, 
which are transcriptionally active only in the cell-free extracts prepared 
from the same species, a phenomenon provided by the species-specific 
transcription factor SL1 (Grummt et al. 1982; Mishima et al. 1982; Learnd 
et al. 1985). Similar specificity was also observed for plants: A tobacco rDNA 
promoter was not recognized in cell-free transcription extracts from bean 
(Fan et al. 1995), and tomato promoter remained inactive after transfection 
into protoplasts of Arabidopsis thaliana (Doelling and Pikaard 1996). How- 
ever, the hypothesis can not explain ND in hybrids of species which are 
sufficiently related to be crossable because in such species not only TIS, but 
also promoter regions upstream and downstream of it are as a rule highly 
homologous (King et al. 1993; Volkov et al. 1996, 2003; Chen and Pikaard 
1997a,b; Akhunov et al. 2001). Similarly, protein factors interacting with the 
promoter region of Solanum species that exhibit differential expression of 
parental rDNA by crossing seem to be nearly identical (N.Y. Komarova, V. 
Hemleben, R.A.Volkov, unpubl.). Furthermore, 35S rDNA promoters of 
Brassica and Arabidopsis species, demonstrating ND when hybridized, are 
fully functional if transfected into protoplasts of the other species (Chen et 
al. 1998; Frieman et al. 1999). 

Extensive investigation of ND in hybrids of wheat, rye and related genera of Triticeae 
revealed that rDNA repeats with longer IGS containing more subrepeated elements up- 
stream of TIS, dominate over those with shorter IGS with less upstream subrepeats (Flavell 
and O’Dell 1979; Martini et al. 1982; Gustafson et al. 1988; Vakhitov et al. 1989; Dvorak 1993; 
Houchins et al. 1997). These data agree well with the enhancer imbalance hypothesis 
originally proposed for hybrids of Xenopus , where upstream subrepeats act as enhancers 
stimulating transcription if cloned adjacent to the gene promoter (Busby and Reeder 1983; 
Moss 1983; Reeder 1984; Pikaard and Reeder 1988; Pape et al. 1989). Assuming that 
enhancers bind transcription factors present in limited amounts, ND could be due to the 
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competition for the factors between IGS of parental species (Reeder 1985; Flavell 1986). 
Selective methylation of under-dominant rDNA probably abolish the binding of transcrip- 
tion factors and results in differential NOR inactivation by chromatin condensation that is 
connected with histone deacetylation (Jasencakova et al. 2001) as shown for Triticum-Secale 
(Sardana et al. 1993; Houchins et al. 1997) and Brassica (Chen and Pikaard 1997a) hybrids. 
Nevertheless, in barley, differential expression of NORs present in wild type on chromo- 
somes 6 and 7 does not always correlate with rDNA methylation in translocation/deletion 
lines (Papazova et al. 2001). Furthermore, in contrast to Triticeae, data obtained for inter- 
specific hybrids of Brassica and Arabidopsis (Chen and Pikaard 1997a, b; Chen et al. 1998) 
were inconsistent with the enhancer imbalance hypothesis, because rDNA with longer 
spacers do not appear to be dominant in these species. However, for Brassica it is difficult 
to discuss the presumptive enhancer function of upstream subrepeats, because in the species 
studied upstream repetitive elements differ not only in number, but also in size and they 
belong to different types (Da Rocha and Bertrand 1995; Bhatia et al. 1996). 

Several observations demonstrate that ND is a phenomenon regulated at 
the chromosomal level. In wild type of barley, NORs on both chromosomes 
6 and 7 are active. If these NORs are placed together on the same chromo- 
some via translocation, NOR 6 dominates over NOR 7, whereas two copies 
of NOR 6 or two parts of splitted NOR 7 on the same chromosome are 
co-dominant (Nicoloff 1979; Georgiev et al. 2001). In hybrids of wheat and 
rye, Triticale , the rye NOR on the short arm of chromosome 1R, is normally 
suppressed. However, translocation of rye NOR on wheat chromosome 1, 
deletion of the long arm of chromosome 1R (Viera et al. 1990a,b) or 
substitution of chromosome 2R by the wheat chromosome 2D (Neves et al. 
1997a, b) result in the de-repression and co-dominance of rye NOR. There- 
fore, these cytological data obviously demonstrate that at least in Triticeae 
the chromosomal context of NOR can modulate ND. Application of mo- 
lecular methods for Brassica species showing ND (Chen and Pikaard 1997b) 
revealed no difference in transcription efficiency of respective rDNA, either 
in protoplast transfection assays or by using an in vitro transcription 
system (Saez-Vasquez and Pikaard 1997; Frieman et al. 1999), i.e. for 
appropriate regulation, the rDNA promoter should be integrated into a 
chromosome. 

A possible explanation of how the chromosomal context can modulate NOR activity could 
be that NORs are passively inactivated as a by-product of silencing of much larger chromo- 
somal segments, with silencing spreading to the NOR (Lewis and Pikaard 2001). In this case, 
genes adjacent to the NOR should be also silenced when ND occurs. To test this idea, 
Arabidopsis suecica, an allotraploid species originated by crossing diploids A. arenosa with 
A. thaliana , was used. In this species, only NORs inherited from A. arenosa are active (Chen 
et al. 1998). However, protein-coding genes adjacent to the inactive NOR 4 inherited from 
A. thaliana remain active in A. suecica (Lewis and Pikaard 2001), demonstrating the local 
nature of silencing and arguing against the possibility that silencing spreads to the NOR 
from the adjacent region. Hence, some structural features of rDNA, perhaps specific se- 
quences in the IGS, probably control differential expression/silencing of rDNA by ND. 




120 



Genetics 



4.2 Ribosomal DNA Rearrangement in Interspecific Hybrids 

Genomes of alloploid hybrids contain rDNA of both or of one of the 
parental species (Borisjuk and Miroshnichenko 1989; Delseny et al. 1990; 
Volkov et al. 1993). However, the structure of rDNA repeats of al- 
lotetraploid Nicotiana tabacum differs from that of the diploid parents, N. 
sylvestris and N. tomentosiformis (Borisjuk et al. 1989, 1997; Volkov et al. 
1991, 1996), although chromosomal localization of rDNA arrays seems to 
be similar in N. tabacum and in parental diploids (Kenton et al. 1993). 
Comparative sequence analysis showed that rDNA repeats of N. tabacum 
originate from N. tomentosiformis , but partial deletion of upstream subre- 
peats and additional amplification of downstream subrepeats occurred 
(Volkov etal. 1999a; see Fig. 1). Only about 8% of N. tabacum rDNA repeats 
remained from N. sylvestris , but they are located in a hypermethylated, 
inactive region of chromosome S12 (Lim et al. 2000). 

Several steps of molecular evolution of N. tabacum rDNA following allopolyploidization 
were proposed. ( 1 ) Nucleolar dominance: After hybridization of the parental diploid species, 
N. tomentosiformis rDNA, with its longer IGS, dominated over the N. sylvestris rDNA, with 
a shorter IGS. (2) Differential elimination/replacement: Functional inactivation of N. 
sylvestris rDNA was accompanied by the structural inactivation via stepwise elimination. 
The remaining arrays of sylvestris-like rDNA were substituted by tomentosiformis- like 
repeats via interloci conversion or/and translocation. (3) rDNA rearrangement: In tomen- 
tosiformis- originating rDNA, the subrepeated region upstream of TIS endured substantial 
reduction due to partial deletion of C-subrepeats (formation of the short variant of N. 
tabacum rDNA) and subsequently the subrepeated region downstream of TIS became longer 
due to amplification of the A-subrepeats (formation of the long variant of N. tabacum 
rDNA). Thus, 35S rDNA in allopolyploid genomes appears to be a target for essential 
molecular reconstruction, obviously confirming the idea of dynamic nature of polyploid 
genomes (Miroshnichenko et al. 1988; Soltis and Soltis 1995; Comai 2000; Wendel 2000). 
Remarkably, the higher rate of rearangement/interlocal homogenization seems to be a 
specific feature of 35S rDNA. So, different 5S rDNA classes originating from diploid pro- 
genitors can still be distinguished in allopolyploid Nicotiana tabacum (Fulnecek et al. 1998) 
and in hexaploid T. aestivum (Van Campenhout et al. 1998). 



5 Nucleolar Proteins and Structural-Functional Organization 
of the Nucleolus 

5.1 Nucleolar Proteins 

Under the group of “nucleolar proteins”, we usually refer to proteins which 
control different steps of ribosome biogenesis (rDNA transcription and 
pre-rRNA processing), but which are not present in the mature ribosome 
(Olson 1991). These proteins are, in turn, the targets of other factors 
involved in the control and regulation of other activities of the cell; thus the 




Molecular Cell Biology 



121 



role of nucleolar proteins is also to integrate ribosome biogenesis in the 
context of the concerted pattern of cellular activities. This is especially 
significant in proliferating cells, in which the nucleolar activity is recog- 
nized to be one of the best markers of cell proliferation and cell cycle 
progression by means of dynamic changes in the nucleolar structure, de- 
tectable in the different periods of interphase and in mitosis, e.g. by silver 
staining of the nucleolar organizer (Ag-NOR; Srivastava and Pollard 1999; 
Derenzini 2000; Medina et al. 2000). 

Recent studies using modern powerful proteomic techniques have re- 
solved the full catalogue of nucleolar proteins in the yeast (Pederson 2002) 
and human nucleolus. Two studies for human cells carried out on the same 
biological material did not arrive at the same results: The number of 
proteins detected was 271 in one case (Andersen et al. 2002), and 213 in 
another (Scherl et al. 2002). Probably, some proteins have been missed in 
each of the analyses, so that the true number of nucleolar proteins should 
be estimated at approx. 350. Less than 100 nucleolar proteins are identified 
in functional terms, but a significant proportion of these proteins are 
involved in functions beyond the known role of the nucleolus in ribosome 
biogenesis. This reinforces the idea of the “plurifunctional nucleolus” (Ped- 
erson 1998, 2002) and also indicates that the nucleolus is a dynamic organ- 
elle, not only from a morpho-functional point of view, but also regarding 
its protein composition, which might change according to the function and 
activity of the cell. Equivalent proteomic analyses have not yet been per- 
formed for the plant cell nucleolus, although the completion of the genomic 
sequence of Arabidopsis and rice and the current technological advance- 
ment (Thiellement et al. 1999) support expectations for a near achievement 
of such work. 

In plants, the isolation and characterization of plant homologues of nucleolar proteins, e.g. 
AtNAP57a homologue to rat NAP57 or yeast Cbf5p pseudouridine synthase, indicate a 
conservation of many nucleolar functions (Maceluch et al. 2001). By complementation of 
the L25 mutation in yeast, At RPL23A-1, the first plant member of the L23/L25 r-protein 
family, demonstrates functional equivalence of the two ribosomal proteins (McIntosh and 
Bonham-Smith 2001). The pea homologue to the DNA helicase I from human (PDH65) is 
recognized by the antibody against HDH I which was localized within the dense fibrillar 
component of the nucleolus; therefore, it is most likely involved in rDNA transcription and 
in the early stages of pre-RNA processing. In addition, it is stimulated by phosphorylation 
with CK2 and cdc2 protein kinases (Tuteja et al. 2001). 

Plants exhibit unique nucleolar features with respect to the snoRNAs (small 
nucleolar RNA) supporting processing of the 35S-rRNA. In contrast to 
vertebrates and yeast intron-encoded snoRNAs, which are processed from 
debranched introns by exonuclease activity, plants have a unique organi- 
zation of the snoRNA genes (Brown et al. 2001). The maize gene clusters 
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include genes from the two major classes of snoRNA: (1) box C/D snoRNA, 
which is considered to guide rRNA O-ribose methylations, and (2) box 
H/ACA snoRNA, thought to guide an rRNA pseudo-uridylation. These 
snoRNA gene clusters are transcribed from an upstream promoter as a 
polycistronic pre-snoRNA, which is then processed by a splicing-inde- 
pendent mechanism and gives rise to snoRNAs with distinct nucleolar 
localization; e.g. snoR2 and snoR3 are closely colocalized with the rDNA 
transcription sites, whereas snoRl shows a similar distribution to that of 
7-2/MRP, a snoRNA involved in the later pre-rRNA cleavage (Shaw et al. 
1998; Leader et al. 1999). Furthermore, a novel histone HI variant (p35) was 
isolated from the nucleoli of monocot as well as dicot plants. It strongly 
binds rDNA in vitro and is considered to be involved in the organization of 
nucleolar chromatin (Tanaka et al. 1999). 

Considering the nucleolar proteins which are functionally conserved in 
higher organisms and also well studied in plant cells, we will concentrate 
on fibrillarin and nucleolin. Fibrillarin was originally discovered through 
the study of the target of a human autoantiserum (Ochs et al. 1985). It is a 
basic protein, 34-38 kDa in size, highly conserved between species through- 
out evolution. The sequence of fibrillarin in yeast, plants, and vertebrates 
contains two motifs which are also present in other nucleolar and nuclear 
proteins. The first of them is the RNA recognition motif (RRM), also called 
consensus RNA-binding domain (CS-RBD), which is a stretch of 80-90 
amino acid residues containing two highly conserved sequences, the RNP- 1 
octapeptide and the RNP-2 hexapeptide; the second is the GAR (glycine- 
and arginine-rich) domain, a sequence formed by around 50% glycines and 
a substantial proportion of arginines, which are frequently in the form of 
dimethylarginine (Schimmang et al. 1989; Lapeyre et al. 1990; Pih et al. 
2000). This GAR domain seems to be responsible for the targeting of 
fibrillarin to the nucleolus (Pih et al. 2000). Interestingly, the fibrillarin gene 
characterized for Arabidopsis thaliana also encodes a novel small nucleolar 
RNA involved in rRNA methylation (Barneche et al. 2000). 

The most important functional feature of fibrillarin is the binding to snoRNAs of the C/D 
box group (U3, U8 and U13 snoRNAs) to form a snoRNP complex active in pre-rRNA 
processing (Baserga et al. 1991). Interaction of fibrillarin with U3 snoRNA is decisive for the 
first step of pre-rRNA processing, the cleavage of the ETS from the pre-rRNA (Kass et al. 
1990). This interaction and the participation of fibrillarin in the earliest steps of pre-rRNA 
processing have been visualized by immunocytochemical localization of the protein in the 
terminal balls of “Christmas trees”, the small globular structures located at the end of 
nascent pre-rRNA transcripts (Scheer and Benavente 1990; Mougey et al. 1993). Fibrillarin 
was named according to the localization in the nucleolar DFC (Ochs et al. 1985; Testillano 
et al. 1991). A detailed biochemical and quantitative immunolocalization study on plant 
fibrillarin in actively proliferating cells showed that two fractions of the protein could be 
identified, one soluble, active in pre-rRNA processing, and another insoluble form, associ- 
ated with the nucleolar matrix; furthermore, the in situ distribution of fibrillarin was shown 
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to be unequal throughout the massive DFC typical of onion root meristematic cells, being 
more concentrated in the transition area FC-DFC and in the proximal part of the DFC than 
in the zones of DFC more distant from FCs. Finally, the levels of fibrillarin appeared to be 
clearly dependent on the transcriptional activity of the nucleolus, and were shown to change 
during the progression of the interphase of the cell cycle (Cerdido and Medina 1995; Medina 
et al. 2000). Recent studies on the fibrillarin gene, characterized for Arabidopsis , have 
demonstrated that its expression is regulated by hormones, e.g. abscisic acid, and it is 
modulated by aging (Pih et al. 2000). 

Nucleolin, a characteristic and prominent nucleolar protein (100 kD) that 
functions as a shuttle protein between nucleus and cytoplasm and is also 
found on the cell surface, was first identified for mammalian cells, where it 
is also called “C23" (Orrick et al. 1973). It appears evolutionarily conserved 
in animals (Lapeyre et al. 1987; Tuteja and Tuteja 1998), yeast (Lee et al. 
1991; Gulli et al. 1995) and plants (Martin et al. 1992; Didier and Klee 1992; 
Bogre et al. 1996; De Career et al. 1997; Tong et al. 1997). Nucleolin was 
initially found to be the target of the cytochemical silver staining of the 
nucleolar organizer (Ag-NOR staining; Lischwe et al. 1979); this is now 
commonly accepted, and it was shown that silver staining occurs at the 
N-terminal domain of the protein (Roussel et al. 1992). 

Like fibrillarin, the structure of nucleolin is modular, containing molecular motifs which 
can also be found in other nucleolar and nuclear proteins. In particular, nucleolin shows a 
tripartite structure, formed by an N-terminal domain, containing a variable number of 
highly charged acidic sequence repeats interspersed with basic segments, a globular central 
domain, containing between two and four RRMs, and a C-terminal domain consisting in a 
GAR region. Variations in the numbers of acidic stretches, as well as in RRMs, are found in 
different taxa (Olson 1991; Tuteja and Tuteja 1998; Ginisty et al. 1999). Sequences in the 
N-terminal domain show homologies to the high-mobility group (HMG) of nonhistone 
chromatin proteins. This domain also contains target sites for phosphorylation by CK2 and 
cdc2 kinase (Caizergues-Ferrer et al. 1987; Belenguer et al. 1990). Homologies of the RRM 
motif and of the GAR region are present, as mentioned for fibrillarin. 



The multipartite structure of nucleolin is reflected in the multiplicity of 
functions, defining it as a “multifunctional” protein. Nucleolin is involved 
in the control of ribosomal RNA transcription (Bouche et al. 1987), in the 
structural organization of rDNA chromatin, including the condensation of 
this chromatin in mitosis (Olson et al. 1983; Erard et al. 1988), in promoting 
secondary structures in pre-rRNA necessary for processing (Bugler et al. 
1987), in preribosome maturation (Herrera and Olson 1986), and in the 
transport of preribosomal particles to the cytoplasm (Borer et al. 1989; 
reviewed by Tuteja and Tuteja 1998). The best-studied mechanism is the 
interaction of nucleolin with pre-rRNA in the transcript processing. Mouse 
nucleolin has been shown to bind with high affinity and specificity to a 
sequence of 1 8 nucleotides in pre-rRNA which adopts a stem-loop structure 
in the ETS, whose cleavage constitutes the first step in pre-rRNA processing 
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(Ghisolfi-Nieto et al. 1996). Nucleolin binding to pre-rRNA is required for 
the processing reaction in vitro, probably by promoting the association of 
other factors such as U3 snoRNP, thus contributing to the formation of the 
processing complex (Ginisty et al. 1998). A similar stem-loop structure has 
also been found in other positions in the sequence of mammalian pre- 
rRNA, which could indicate that the mechanism by which nucleolin acts in 
the first step of processing is repeated in further steps (Serin et al. 1996). On 
the contrary, the interaction of nucleolin with rDNA is still poorly under- 
stood. The affinity of nucleolin for rDNA sequences located in the IGS has 
been reported and, in some cases, these sequences have been identified as 
matrix- attachment regions (Olson et al. 1983; Dickinson and Kohwishige- 
matsu 1995; McGrath et al. 1997). It has been demonstrated that nucleolin 
displaces the chromatin-binding domain of histone HI, thus being capable 
of modifying the basic structure of chromatin for transcription (Erard et al. 
1988), and a further insight in the relationship of nucleolin with the tran- 
scription complex has recently evidenced that increasing amounts of nu- 
cleolin drastically reduces in vitro the number of transcription complexes 
formed by rDNA and RNA pol I (Roger et al. 2002). 

Many of the functions of nucleolin are phosphorylation dependent 
(Ballal et al. 1975; Bourbon et al. 1983). The primary structure of nucleolin, 
in animal and plant cells, contains sequences corresponding to sites of 
phosphorylation by CK2 and cdc2 kinase (Lapeyre et al. 1987; Bogre et al. 
1996; Tong et al. 1997); actually, nucleolin is phosphorylated on serine by 
CK2 during interphase of proliferating cells (Caizergues-Ferrer et al. 1987), 
and on threonine by the cyclin-dependent cdc2 kinase during mitosis 
(Belenguer et al. 1990). 

In plants, the two proteins with homologies to nucleolin identified in onion root mer- 
istematic cells, NopA64 and NopAlOO, are phosphorylated in vitro by CK2 and cdc2 kinase, 
and the two kinases were shown to colocalize with the nucleolin homologues. NopAlOO is 
the most phosphorylated nuclear protein of actively proliferating onion root cells (De Career 
et al. 1997; Medina et al. 2001, and unpublished results). The exact mechanism of the 
functional modulation of nucleolin by phosphorylation is not well understood. Unphospho- 
rylated nucleolin is a potent transcription inhibitor in vitro, and this form is recovered 
associated with chromatin in resting cells (Bourbon et al. 1983; Lapeyre et al. 1987). On the 
other hand, phosphorylated nucleolin is cleaved during transcription of rRNA genes (War- 
rener and Petryshyn 1991). It has been speculated that unphosphorylated nucleolin binds 
to rDNA and represses transcription; phosphorylation induces proteolysis, and conse- 
quently releases nucleolin from DNA, thus activating transcription. The physiological 
proteolysis of nucleolin is also related to the proliferative state of the cell, since the nucleolin 
self-cleaving activity was inhibited by a nucleolar extract prepared from proliferating cells 
(Chen et al. 1991 

The levels of nucleolin have been shown to be highly dependent on cell 
proliferation, and on the phase of the cell cycle (De Career et al. 1997; Sirri 
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et al. 1997). In alfalfa, the nucleolin gene is not expressed in mitotic-arrested 
cells, but, upon mitogenic stimulation and cell cycle reentry, the gene is 
induced very early, exactly at the same time as a marker cyclin gene, 
indicating that nucleolin gene expression and cyclin expression could be 
regulated by the same factors. Otherwise, the expression of nucleolin is 
restricted to proliferating tissues of the plant (Bogre et al. 1996). In pea, the 
expression of nucleolin is regulated by light, following a close correlation 
with the effects of light in increasing cell proliferation and mitotic activity 
(Reichler et al. 2001). In fact, the modulation of the nucleolin function by 
phosphorylation by CK2 and cdc2 kinase is related to cell proliferation, 
since these two kinases are known to phosphorylate a battery of different 
targets as a response to proliferative signals, and as a modulation of the 
different periods of the cell cycle (Pinna 1990; Doree and Galas 1994). 
During cell division, onion nucleolin, as a component of the pre-rRNA 
processing complex, is found first at the periphery of chromosomes, and 
then in prenucleolar bodies, playing a part in nucleologenesis (Medina et 
al. 1995, 2001). Immunolocalization of nucleolin in onion cells, either by a 
heterologous antibody (Martin et al. 1992) or by homologous anti-NopA64 
(Medina et al. 2001), allowed the definition of different functional subdo- 
mains in the structurally homogeneous DFC; the same subdomains could 
also be defined after in situ localization of fibrillarin (Cerdido and Medina 
1995). These are examples showing the contribution of the localization of 
nucleolar proteins to the elucidation of the functional architecture of the 
nucleolus. 



5.2 Structural-Functional Organization of the Nucleolus 

Structural domains in the nucleolus with functional significance could be 
defined by the localization in situ of transcription of rRNA genes (see Busch 
and Rothblum 1983). However, there was and is still an ongoing debate, 
whether transcription of rRNA takes place in nucleolar FCs or in the DFCs 
(Fakan and Puvion 1980; Scheer and Rose 1984; Thiry and Thiry-Blaise 
1989; Wachtler et al. 1989). In situ run-on experiments showed the site of 
transcription at the boundary between FCs and the DFC (Dundr and Raska 
1993; Hozak et al. 1994). For plants, these questions had been summarized 
in an earlier review on the structural and functional organization of the 
nucleolus (Leweke and Hemleben 1982). Here, we will focus on the recent 
state of the problem regarding plant cell nucleoli. 

Quantitative immunolocalization of DNA and RNA pol I in the onion nucleolus showed the 
transition area between FCs and the DFC as a significant nucleolar domain for rDNA 
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transcription (Martin et al. 1989; Medina et al. 1990; Martin and Medina 1991). Further 
studies, using different experimental methods, consistently showed the transcription 
marker in a restricted nucleolar area comprising the outer periphery of FCs and the inner 
periphery of the DFC (Testillano et al. 1994; Shaw et al. 1995; Melcak et al. 1996). The use of 
Br-UTP incorporation to isolated onion nuclei evidenced a focal arrangement of the tran- 
scription sites in the nucleolus; most of these foci were located at the periphery of FCs, but 
the core of every FC appeared devoid of labelling; otherwise, transcription foci did not totally 
surround FCs, so that transcription occurred at discrete points at the boundary domain 
between FCs and the DFC. Colocalization between transcription and fibrillarin, the latter 
used as a marker of early processing, was only partial in transcription foci, since fibrillarin 
was not found in the zones of these foci immediately bordering FCs. Finally, other transcrip- 
tion foci were found deep in the DFC, in zones located between FCs (De Career and Medina 
1999; Medina et al. 2001). The focal organization of nucleolar transcription had also been 
shown in other plant models (Thompson et al. 1997), but the relationship of these foci with 
FCs was questioned. Recently, Br-UTP incorporation has been detected in pea nucleoli using 
pre-embedding labeling and three-dimensional electron microscopy of entire nucleoli 
(Gonzalez-Melendi et al. 2001). Labeling, observed as large irregular silver particles resulting 
from enhancement of nanogold, forms relatively large clusters of particles in panoramic 
views of thin serial sections of the nucleolus (which could correspond to transcription foci 
of the confocal microscope); these large clusters can be resolved in smaller clusters at higher 
magnifications, possibly representing single transcription units (“Christmas trees”). The 
structure underlying the clusters is supposed to be the DFC, but no indication is given as to 
where FCs could be situated. According to the authors’ interpretation, actively transcribed 
rDNA would be an extended loop, emanating from the condensed rDNA chromatin adjacent 
to the nucleolus and continuously running through the DFC (Gonzalez-Melendi et al. 2001). 

Earlier work had shown the conversion of the heterogeneous FCs, undoubtedly contain- 
ing condensed rDNA chromatin, into homogeneous FCs, the small type of FCs present in 
active plant nucleoli containing decondensed chromatin, in the course of germination 
(Deltour 1985; Risueno and Medina 1986) as well as during postmeiotic nucleolar reactiva- 
tion in young microspores (Esponda and Gimenez-Martm 1974; Medina et al. 1983). The 
existence of a nucleolar matrix or nucleolar proteinaceous skeleton with the function, among 
others, of gathering rDNA in FCs and structurally organizing it depending on the nucleolar 
activity and the requirements of the cells (Moreno Dfaz de la Espina 1995) is now supported 
by the fact that MAR or S AR elements are present in the AT -rich region of every rDNA repeat 
(see above). 

Therefore, it can be summarized that rRNA transcription complexes are 
assembled within FCs, but actual transcription only occurs at certain dis- 
crete points at their periphery in the DFC (Fig. 2). Furthermore, transcrip- 
tionally active chromatin can also be found deep in the DFC, in the stretches 
of rDNA chromatin connecting contiguous FCs. Certainly, some additional 
work is still needed for a full understanding of the molecular architecture 
of the transcription of rRNA genes in the nucleolus (see Huang 2002). 



6 Ribosome Biogenesis 

Ribosome biogenesis is a complex process which takes mainly place in the 
nucleolus starting with the transcription of the rRNA genes by pol I and III 
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Fig. 2. Schematic presentation of the organization of 35S rDNA and transcription of rRNA 
genes in the structural components of the nucleolus, according to available experimental 
data. The drawing at the top represents the typical structure of the plant cell nucleolus, as 
observed in an ultrathin section under the electron microscope. The dense fibrillar compo- 
nent (DFC) is massive and appears surrounded by the granular component (GC). Lighter 
small areas inside the DFC are fibrillar centers ( FC ). The nucleolus is organized around 
rDNA, in such a way that the fiber of DNA containing rRNA genes runs across the nucleolus 
following a meandering pathway, which previously received the name of “nucleolar organ- 
izer track” by Godward (1950). This track passes through FCs and is embedded in the DFC. 
The bottom drawing represents the area inside the square , ideally stretched and magnified. 
Fibrillar centers represent foci of accumulation of rDNA which is ready for transcription, 
with bound pol I and factors (except for condensed chromatin inclusions of heterogeneous 
FCs), but actual transcription only occurs in loops emerging from the concentration of 
rDNA. The organization of these rDNA foci is probably assured by structural proteins of the 
nucleolar matrix. As a result of transcription, pre-rRNA is formed, which is located in the 
DFC. Therefore, the site of transcription is the transition area FC-DFC, also called the 
FC-DFC boundary. The organization of transcription in loops causes that the entire bound- 
ary of FCs is not occupied by transcription units, or “Christmas trees” ( TU ), which are only 
detected at certain places. These TUs also appear in the rDNA connecting contiguous FCs, 
which structurally corresponds to the bulk of DFC 



transcribing the small 5S rRNA genes outside of the nucleolus, whereas 
RNA polymerase I generates a precursor molecule directly in the nucleolus 
which is rapidly processed by endonucleases and exonucleases into the 25S, 
5.8S and 18S rRNAs and modified by pseudouridylation and methylation 
(Venema and Tollervey 1999). During these processing reactions, a large 
number of non-ribosomal proteins associate with the pre-RNAs (see above) 
in parallel with the starting assembly of the approximately 80 ribosomal 
proteins onto the RNA (Warner 1989, 1999, 2001). For the animal system, 
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the view of the pathway of ribosome biogenesis is becoming more and more 
detailed (see Fig. 3). Early analyses in the 1970s already reported that the 
45S pre-rRNA is initially assembled with ribosomal proteins to form a 90S 
particle (Trapman et al. 1975; Kruiswijk et al. 1978), which - as now known 
- also includes many non-ribosomal proteins (Dragon et al. 2002; Grandi 
et al. 2002). Subsequently, the pre-RNA undergoes three rapid cleaving 
reactions leading to the separation of the 90S particle into the large and the 
small subunits (Trapman et al. 1975, Venema and Tollervey 1999). Analyses 
of the large pre-60S particle revealed that they show dynamic changes 
during maturation and that a whole family of pre-60S particles exists 
carrying many different accessory proteins (Milkereit et al. 2001; Nissan et 
al. 2002). During maturation, the pre-60S particle is first released into the 
nucleoplasm and then exported to the cytoplasm via the Xpo 1/Crml /ex- 
portin' 1 export pathway with the help of Nmd3 and Mtr2 and most likely 
other pre-60S associated proteins (Ho et al. 2000; Gadal et al. 2001). The 
export mechanism of the 40S subunit is still unclear, but it seems that it 
undergoes a comparatively simpler maturation (Moy and Silver 1999; 
Venema and Tollervey 1999). A model for the pathway of the 60S pre-ribo- 
some maturation and export of 40S and 60S particles is presented in Fig. 3. 

The complexity of ribosome biogenesis gives rise to many steps and 
opportunities where regulation could take place (reviewed by Leary and 
Huang 2001). 



7 Ribosome-Inactivating Proteins (RIPs) 

Ribosome-inactivating proteins (RIPs) consist of toxins that catalytically 
inactivate ribosomes at a universally conserved region of the large ribo- 
somal RNA, the oc-sarcin/ricin loop (S/R loop). Previously, it was found that 
a fungal toxin, a-sarcin, from Aspergillus giganteus, inactivates ribosomes 
by hydrolyzing a single phosphodiester bond on the 3’ side of G4325 in the 
cc-sarcin/ricin (S/R) loop of rat 28S ribosomal RNA or its equivalent residue 
G2661 in E. coli 23S ribosomal RNA (Endo and Wool 1982). Ribosome-in- 
activating proteins are also found in plants as toxic compounds that are 
remarkably potent inactivators of the eukaryotic protein synthesis system 
(for reviews, see Stirpe and Barbieri 1986; Stirpe et al. 1992; Parente et al. 
1993). In contrast to fungal and bacterial translation inhibitors, RIPs are 
proposed to play a defence role in plants. RIPs belong to the group of RNA 
N-glycosidases, because they carry out a single N-glycosidation event at a 
specific and conserved A residue of the 25/28S rRNA of the eukaryotic 60S 
ribosome subunit that alters the binding site of the translational elonga- 
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tional factor eEFlA. Therewith they prevent protein synthesis leading to 
subsequent cell death. 

Different forms of RIPs have been identified in plants. Most commonly 
RIPs are single-chain proteins (type-1 RIPs) with a molecular weight of 
25-30 kDa. Type 2 RIPs consist of a toxin-part chain (A-chain), and a 
lectin-part chain (B-chain), bound by a disulfide bond, having molecular 
weights of 60-65 kDa and approximately 120 kDa, respectively, that binds 
to cell surfaces. The type 2 RIPs are potent toxins, the best known of which 
is ricin from Ricinus communis. Type 1 RIPs and the A-chains of type 2 RIPs 
possess sequence and structural homology. For a type 2 RIP, the B-chain of 
the molecule first binds to cell surface with its lectin facility, then the 
A-chain enters the cell through endocytosis to kill the cell by inactivating 
ribosomes. Such biomolecules causing cytotoxicity are being exploited for 
designing immunotoxins/hormonotoxins using heterobifunctional conju- 
gates. These carriers conjugated with the RIPs can influence cellular traf- 
ficking and inhibit protein synthesis. It is assumed that these proteins from 
plants can be utilized for various therapeutical treatments including cancer, 
AIDS and other viral diseases of present times (Stirpe et al. 1992). RIPS have 
been tested extensively for use as plant defence molecules, particularly 
against fungal diseases (Logemann et al. 1992; Stirpe et al. 1992; Nielsen et 
al. 2001). Ribosome-inactivating proteins are found in different parts of 
plants, in concentrations ranging from a few micrograms to several hun- 
dred milligrams per 100 g of plant tissues (Singh and Singh 2000). 

In particular, the abundant RIP I from maize ( Zea mays) kernels was tested for antifungal 
activity (Nielsen et al. 2001). The substrate specificity of the maize RIP I differs across diverse 
taxa (Krawetz and Boston 2000). The maize RIP I was found to be produced as a zymogen, 
proRIP I. ProRIP I accumulates during seed development and becomes active during 
germination when cellular proteases remove acidic residues from a central domain and both 
termini. A wide spectrum of RIPs was isolated from different cucurbits (family Cucurbi- 
taceae). The liposome-mediated delivery of the type 1 RIP luffin to human melanoma cells 
in vitro was described (Poma et al. 1999). The exposure of melanoma cells to two types of 
liposomes resulted in the inhibition of protein synthesis and cell growth. A new RIP 
(5-momorcharin) and a candidate RIP (e-momorcharin) were isolated from the seeds and 
fruits of the bitter gourd Momordica charantia (Cucurbitaceae; Paul et al. 1999). Recent 
studies have shown that some RIPs possess strong anti-human immunodeficiency virus 
(HIV) activity, and several common plant RIPs including agrostin, gelonin, luffin, lant RIPs 
including agrostin, gelonin, luffin, ss strong anti-human immunodeficiency virus (HIV) 
activity, and several common pype 1 RIP luffin to human melanoma cells in vitro was 
described (Poma et al. 1999)ed RIPs, MAP30 and GAP 31, isolated from Momordica charan- 
tica and Gelonium multiflorum , respectively, inhibited the replication of human immunode- 
ficency virus type 1 (HIV-1) by inhibiting the viral integrase activity (Bourinbaiar and 
Lee-Huang 1996), indicating a different mechanism of these compounds. 

Pokeweed (Phytolacca americana) antiviral protein (PAP), another type 1 RIP, was 
purified from Agrobacterium rhizogenes- transformed hairy roots of Phytolacca americana 
(Park et al. 2002a, b) from which it was constitutively secreted as part of the root exudates. 
Obviously, PAP-R depurinates fungal ribosomes only in vitro, suggesting a synergistic 
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mechanism that enables PAP-R to penetrate fungal cells. PAP could also inhibit translation 
of mRNAs and viral RNAs that are capped by binding to the cap structure and depurinating 
the RNAs downstream of the cap (Parikh et al. 2002). Possibly, this single-chain RIP targets 
not only the large rRNA, but also its own mRNA, supporting the idea of a multifunctional 
protein. 

The type 2 RIPs (RIP II), firstly represented by ricin, the toxic RIP of Ricinus 
communis , recently have attracted enormous interest (Di Cola et al. 2001). 
Another type 2 RIP, the mistletoe lectin I (ML I) from Viscum album , which 
consists also of two chains, is even more toxic than ricin: The A-chain fulfills 
the proper RIP function of an RNA N-glycosidase which acts at the specific 
A residue of the large rRNA and inhibits translation at the ribosome; the 
B-chain has lectin activity interacting with specific sugar residues and 
mediating the transport of the dimeric complex through the plasma mem- 
brane. Since it is proposed that this ML I will be used therapeutically in 
cancer therapy, this protein has been studied in detail. Also, cinnamomin 
was characterized as a new type 2 RIP (Xie et al. 2001; Liu et al. 2002). It is 
expressed specifically in cotyledons and accumulates in large amounts as a 
storage protein at later stages of seed development. 

Thus RIPs comprise plant compounds which may be naturally used in 
plant defence. However, a clear biological significance of RIPs in plant 
physiology remains largely elusive. Nevertheless, they attract enormous 
interest as antiviral and possibly as anti-tumor therapeutically active sub- 
stances. 



8 Conclusions and Perspectives 

Ribosome biogenesis and function of ribosomes are supposed to represent 
basic functions of the cell, and the components were often grouped into 
house-keeping functions which are constitutively provided. Now, it be- 
comes more and more clear that expression is highly regulated on various 
levels and by multiple exogenous (light, different stress conditions etc.) and 
endogenous cellular stimuli which influence cell activity and proliferation 
status of the cells, like phytohormones and others. Similarly, the genes 
coding for rRNA and for nucleolar and r-proteins are expected to be highly 
regulated and the gene products to be delivered corresponding to the cell 
stage. Thus, the idea of the “plurifunctional nucleolus” now stimulates 
further research, and it reminds us that the nucleolus is a highly dynamic 
nuclear structure. Extensive investigations on the structural organization 
and molecular evolution of the rRNA genes are now available and can be 
applied for various aspects, such as following speciation processes and 
tracing phylogenetic relationships. In contrast, regulation of transcription 
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of the ribosomal genes is still poorly understood and needs further research 
approaches in plants. Interesting phenomena, like nucleolar dominance or 
gene silencing, can be observed depending on the phylogenetic relationship 
of hybrid partners. Processing of pre-rRNA has been mainly studied for 
yeast, but similar snoRNPs may be involved in plant pre-rRNA processing 
which is supposed to be also a regulating step. Proteomic studies in animals 
and yeast slowly offer an overview on the functional nucleolar proteins. For 
the plant cell nucleolus, respective proteomic analyses have not yet conse- 
quently been performed and can be expected only in the future. In addition, 
plants provide various ribosome inactivating proteins which gain increas- 
ing importance as therapeutically useful substances. 
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Genetics of Phytopathology: 

Fungal Morphogenesis and Plant Infection 

Stefan G.R. Wirsel, Sven Reimann, and Holger B. Deising 



1 Introduction 

Many fungi have evolved distinct cellular structures to facilitate the inva- 
sion of their host plants. These infection structures exhibit a range of 
morphological and physiological specialisations (Staples and Hoch 1987; 
Mendgen and Deising 1993; Deising et al. 1996; Perfect and Green 2001). 
The fact that fungi belonging to different taxa have developed a highly 
specialised cell type, called appressorium, to enter their host and to gain 
access to the nutrient sources inside plant cells or within the apoplast 
(Mendgen and Deising 1993) may indicate the existence of a “force” that 
drives evolution of fungal infection structures. In this respect it might be 
interesting to note that extant arbuscular mycorrhizas can also penetrate 
roots of their host by appressoria. 

Parasitic relationships between fungi and plants had already developed in the Lower 
Devonian, as indicated by recent fossil findings in the Rhynie chert in Scotland. Taylor and 
coworkers (Taylor et al. 1999) discovered hyphae of a pathogenic ascomycete extending 
through the cuticle of the early land plant Asteroxylon. Successful invasion of and propaga- 
tion on plants seen in these samples indicates that pathogenic relationships between fungi 
and the early land plants existed 400 million years ago (Taylor et al. 1999), and co-evolution 
of some plant pathogenic fungi and their host plants has thus taken place since (Bakkeren 
et al. 2000; Mugnier 2002). Recent molecular phylogenetic studies based on multiple protein 
analyses have indicated that interactions between plants and associated fungi might have 
been established much earlier (Heckman et al. 2001). These data suggested that land plants 
appeared around 700 million years ago and that the major lineages of fungi divided between 
1.5 billion and 970 million years ago. It was hypothesised that colonisation of the land by 
plants was supported by symbiotic fungi, probably by glomalean type mycorrhizas (Pirozyn- 
ski 1976; Selosse and Le Tacon 1998; Blackwell 2000). 

Frank (1883) first coined the term ‘appressorium’ (= adhesion organ) to describe the 
apical swellings of germ tubes observed with Fusicladium sp. and Gloeosporium ( Colie - 
totrichum) lindemuthianum infecting poplar ( Populus tremula ) and bean ( Phaseolus vul- 
garis ), respectively. Emmett and Parbery (1975) extended this definition to “all structures 
adhering to host surfaces to achieve penetration, regardless of morphology”. 

Appressoria may be visible as inconspicuous swellings of the germ tube as 
in Ustilago maydis (Snetselaar and Mims 1993) or in basidiosporelings of 
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rust fungi (Gold and Mendgen 1991), or discrete swollen, lobed or dome- 
shaped cells, separated from the germ tube by a septum as in rust uredinio- 
and aeciosporelings, in Magnaporthe grisea and Colletotrichum species, 
and in many other plant pathogens (Hahn et al. 1997a). Several reports 
indicate that infection structure differentiation is regulated by an ordered 
expression of a multitude of infection structure-related genes. In rust 
urediniosporelings, for example, single surface signals are capable of induc- 
ing a sequence of morphological responses leading to infection structure 
differentiation, and synthesis of various cell wall-degrading enzymes (Hoch 
et al. 1987; Mendgen et al. 1996; Read et al. 1997). Likewise, perception of 
single chemical signals present on or hydrophobicity or hardness of the 
plant surface have been described to trigger the expression of an array of 
genes required for appressorium formation (Podila et al. 1993; Lee and 
Dean 1994; Kolattukudy et al. 1995; Gilbert et al. 1996; Dean 1997; Kim et 
al. 2002, and references therein). 

During the last decade, significant progress in understanding mecha- 
nisms and functions of fungal infection structures has been made. This has 
been possible mainly by modern cell and molecular biological approaches, 
leading to the identification of genes essential for infection-related mor- 
phogenesis and virulence or pathogenicity. 

This review will describe the events beginning with the germination of 
fungal spores on the plant surface leading to its penetration and estab- 
lishment of the pathogen within the host. We will draw attention to the link 
between the form and the function of fungal infection structures where 
appropriate, and will provide molecular data underlining more “classical” 
genetic and physiological results. Emphasis will be given to demonstrate 
that different fungi use different strategies to gain access to nutrients from 
the interior of plant cells or from apoplastic compartments, which is a 
requirement for successful colonisation and reproduction. We will also 
discuss that a detailed knowledge of morphogenic differentiation exhibited 
by fungal plant pathogens during various stages of infection might be 
helpful to understand the mode of action of fungicides currently in use. This 
might also be useful for rational drug design and screening procedures. 



2 Infection Structures and Modes of Entry 

The mode of entry depends on the infection structures differentiated. Fungi 
that form distinct and melanised infection structures are able to generate 
significant turgor pressure which is translated into force. Examples of fungi 
that differentiate melanised appressoria are Magnaporthe grisea , different 
Colletotrichum spp. and Gaeumannomyces graminis , the latter forming 
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hyphopodia. In forceful invasion cell wall-degrading enzymes (CWDEs) are 
thought to play a minor role, although the exact impact of CWDEs needs to 
be analysed carefully in each fungal system. In contrast, fungi that do not 
form elaborate appressoria, e.g. Botrytis cinerea, Claviceps purpurea, or 
different Cochliobolus or Fusarium spp., are likely to depend on CWDEs for 
initial invasion. In order to mechanistically understand the different modes 
of infection, we will describe in detail force- and enzyme-dependent pene- 
tration of the host epidermis. 



2.1 T urgor-Mediated Breaching of the Plant Cell W all 

Although the morphology of appressoria in different species varies signifi- 
cantly (Howard 1997), it has since long been accepted that these cells 
mediate penetration of plant cuticle and epidermal cell wall (Frank 1883; 
Miyoshi 1895). Irrespective of whether fungi use enzymes or force, or a 
combination of both to penetrate, they need to adhere tightly to the plant 
surface. 

A clear indication of the strength of adhesion is provided by the fact that 
the appressorial base remained attached to the plant cuticle after sonica- 
tion, which completely removed the upper part of the cell (Howard 1997). 
Adhesion is necessary to counteract the force exerted onto the underlying 
surface by the penetration peg arising from the appressorial base. A con- 
vincing illustration of appressorial adhesion is provided by studies employ- 
ing optical waveguides (Bechinger et al. 1999; Bastmeyer et al. 2002). 

Optical waveguides consist of a transparent high viscosity polydimethylsiloxane (PDMS) 
layer of 1-pm thickness, sandwiched between two thin films of aluminium. Light propaga- 
tion in planar waveguides allows optical imaging and thus direct determination of vertical 
forces exerted by penetration pegs, and also forces needed to counteract these. While the 
penetration peg exerts force onto the waveguide, a ring-shaped area of the appressorial base 
surrounding the penetration peg lifts the elastic waveguide, indicating adhesion to counter- 
act the invasive force. This technique allowed Bechinger and co-workers (1999) to directly 
measure forces of up to 25 pN (mean value 16.8 pN) exerted by single melanised appressoria 
of C. graminicola. For comparison, if a force of 17 pN pm" 2 were exerted over the palm of a 
hand, a human could lift an 8,000-kg school bus (Money 1999). Howard and coworkers 
(1991b) demonstrated that M. grisea is able to forcefully penetrate hard synthetic surfaces 
such as Mylar membranes. In order to determine the turgor pressure of appressoria, these 
infection cells were incubated in solutions with increasing osmotic pressure, and the point 
of incipient cytorrhysis was taken as a measure of intracellular osmotic potential. These 
indirect turgor measurements showed that the rice blast fungus can generate pressures of 
up to 8 MPa (80 bar) within its appressorium (Howard et al. 1991b). Already in 1895, Miyoshi 
took a simple experimental approach, observing puncturing of plant cells by needles of 
known diameter and weight, and estimated that the pressure needed to breach the epidermis 
of Allium cepae and Tradescantiaprocumbens corresponds to 0.35 and 0.5 MPa, respectively 
(Miyoshi 1895). The pressure in melanised appressoria of M. grisea and C. graminicola 
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corresponds to values between 5 and 8 MPa (Howard et al. 1991b; Bechinger et al. 1999), 
suggesting that these fungi could enter plant cells without enzymatic assistance. 

The mechanism of turgor generation has been discussed for a long time, 
and in the 1980s it was recognised that melanin plays an important role in 
the penetration process. Kubo and coworkers have recently identified novel 
transcriptional activators of Magnaporthe grisea and Colletotrichum 
lagenarium , containing Cys2His2 zinc finger and Zn(II)2Cys6 binuclear 
cluster DNA-binding motifs, that regulate transcription of the melanin 
biosynthesis genes polyketide synthase, trihydroxynaphthalene reductase, 
and scytalone dehydratase in a development-specific manner (Kubo et al. 
2000; Tsuji et al. 2000). In most fungal systems, melanins are thought to 
protect the mycelium against environmental stresses such as UV radiation, 
oxidants, antifungal drugs and macrophages formed by man and other 
animals in response to fungal infection (Henson et al. 1999; Nosanchuk et 
al. 1999). In addition, melanin acts as a scavenger of free radicals, a property 
essential to pathogenicity of the facultative human pathogen Cryptococcus 
neoformans (Nosanchuk et al. 1999). In the black yeast Wangiella dermati- 
tidiSy a polymorphic pathogen of man, melanin is incorporated into the 
outer cell wall and appears to mediate biomechanical properties and to 
support invasive hyphal growth into solid substrata (Brush and Money 
1999). 

Melanin-deficient mutants of Colletotrichum spp. and M. grisea as well 
as wild-type isolates treated with inhibitors of melanin biosynthesis were 
unable to penetrate their host (Kubo and Furusawa 1991; Henson et al. 1999, 
and references therein). In M. grisea and in C. graminicola melanin was 
shown to reduce the pore size in the appressorial wall to less than 1 nm 
(Howard et al. 1991b; J.A. Sugui and H.B. Deising, unpubl. data). As a 
consequence, water, but no larger molecules, can pass this layer, leading to 
water influx and turgor generation when osmotically active solutes are 
accumulated in the appressorium (Fig. 1). Talbot and coworkers showed 
that glycerol accumulates to concentrations of at least 3 M in wild-type 
appressoria of M. grisea (De Jongetal. 1997). Such high concentrations were 
neither present in appressoria of a melanin-deficient mutant nor in appres- 
soria treated with the melanin biosynthesis inhibitor, tricyclazol. De Jong 
et al. (1997) also demonstrated that non-melanised appressorial walls are 
permeable to glycerol, whereas those of melanised appressoria are not. The 
turgor resulting from this glycerol concentration was calculated to be at 
least 5.8 MPa, which is close to that estimated from indirect turgor meas- 
urements (Howard et al. 1991b). Interestingly, glycerol accumulation in 
appressoria is regulated by a different signal transduction pathway than the 
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Fig. 1. Biosynthesis, localisation and function of appressorial melanin. A Melanin is synthe- 
sised from acetyl-CoA by polyketide synthase (PKS) and two subsequent reductase- (medi- 
ated by tetra- and trihydroxynaphthalene reductase, THR) and dehydratase reactions 
(mediated by scytalone dehydratase, SCD). THR is inhibited by tricyclazole fungicides, SCD 
by carpropamid. B The melanin layer ( arrows ) is localised in the appressorial cell wall, in 
close vicinity to the plasma membrane. C By incorporation of melanin the pore size of the 
appressorial wall is reduced to less than 1 nm, so that larger molecules such as osmotically 
active compounds, e.g. glycerol, cannot migrate through plasma membrane and cell wall. 
In contrast, water is taken up, resulting in generation of high turgor pressure, which is 
translated into force at the appressorial base. B and C are from Howard et al. (1991a) 






152 



Genetics 



cellular response to hyperosmotic stress, which is compensated by accumu- 
lation of arabitol (Dixon et al. 1999). 

Appressoria of C. graminicola formed in the presence of the chitin 
synthase inhibitor Nikkomycin Z, incorporate melanin into their appres- 
sorial walls, but these walls are not rigid enough to withstand the developing 
turgor pressure of 5 to 8 MPa and burst (S. Werner and H.B. Deising, 
unpubl. results). Thus, chitin and probably also the other prominent cell 
wall polymer (3-1,3 glucan mediate appressorium rigidity, whereas melanin 
serves as the selectively permeable wall layer. As indicated by dense label- 
ling of appressorial walls of M. grisea by WGA gold conjugates (Howard et 
al. 1991a), chitin is a prominent structural component of these cells. Similar 
results have been obtained for C. graminicola (U. Rauchhaus, G. Hause and 
H.B. Deising, unpubl. data). 



2.2 The Role of Cell Wall-Degrading Enzymes in Fungi 
with Elaborate Appressoria 

In different Colletotrichum species, CWDEs have received considerable 
attention, with emphasis on pectic enzymes. Enzyme activities have been 
primarily been detected in necrotrophic growth phases, e.g. in C. gramini- 
cola- infected maize pith or in bean leaves infected with C. lindemuthianum 
(Nicholson et al. 1976; Wijesundera et al. 1989). Polygalacturonase activity 
was not detected in the latter interaction. Likewise, in the interaction 
between G gloeosporioides f.sp. malvae and its host, Malva pusilla only 
pectic lyase activity was detected at the end of the biotrophic phase and it 
further increased in the necrotrophic phase of infection. A comparison of 
expression of pel-1 and pel-2 genes in culture and in planta with other 
pectinase genes of C. gloeosporioides f. sp. malvae and other plant patho- 
genic fungi suggests that during necrotrophic infection genes that are not 
under catabolite control are predominantly expressed (Shih et al. 2000). 
Lack of polygalacturonase activity may be due to the presence of protei- 
naceous inhibitors of these enzymes, which have been found in many plants 
(De Lorenzo et al. 2001; De Lorenzo and Ferrari 2002). Two endo- polygalac- 
turonase genes ( clpgl , clpg2 ) of G lindemuthianum have been cloned 
(Centis et al. 1997). Experiments with the GFP reporter gene fused to the 
promoter of clpg2 indicated that in spite of the high turgor generated by 
melanised appressoria, enzymes may assist the penetration process (Du- 
mas et al. 1999). A similar conclusion can be drawn from a different line of 
evidence. Inactivation of the Snfl homologue (see below) of G graminicola 
resulted in reduction of extracellular activities of CWDEs, reduced rates of 
penetration and reduced virulence towards maize leaves (M. Wernitz and 
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H.B. Deising, unpubl. data). Analyses of the C. lindemuthianum clpg2 
promoter identified a cis-acting sequence with similarity to a yeast filamen- 
tation and invasion response element, suggesting that invasive growth and 
depolymerising activity are co-ordinatedly regulated (Herbert et al. 2002). 
The Kssl MAPK cascade of S. cerevisiae controls both dimorphic develop- 
ment (formation of pseudohyphae and invasive growth) and a depoly- 
merase (polygalacturonase) (Madhani et al. 1999). Also, in the mammalian 
pathogen Candida albicans, the same transcription factor ( CaTECl ) con- 
trols hyphal development and expression of extracellular proteases 
(Schweizer et al. 2000). 

However, in addition to hydrolytic enzymes, pectate or pectin lyases that 
cleave a- 1 ,4-bonds by introducing a A-4,5-unsaturated bond, have received 
some attention. Antibodies raised against pectate lyase of C. gloeosporioides 
had no effect on spore germination, germ tube elongation or appressoria 
formation, but when conidia were mixed with the antibodies prior to 
inoculation, symptom development was inhibited on avocado, mango and 
banana fruits (Wattad et al. 1997). Targeted C. gloeosporioides mutants 
deficient in the pectate lyase ( pelB ) gene exhibited 25% lower pectate lyase 
(PL) and pectin lyase (PNL) activities and 15% higher polygalacturonase 
(PG) activity than the wild type. When pelB mutants were inoculated onto 
avocado fruits, a reduction in decay diameter by approximately 40% was 
observed compared with the wild type and undisrupted transformant con- 
trols, supporting the role of pelB as a virulence factor in C. gloeosporioides 
(Yakoby et al. 2001). Accordingly, when the C. gloeosporioides pelB gene 
was transformed into and expressed in C. magna, a pathogen of cucurbits 
that causes minor symptoms on watermelon seedlings and avocado fruits, 
the transformants showed increased pectate lyase activity and increased 
maceration capabilities on avocado pericarp, and more severe maceration 
and damping off developed on watermelon seedlings (Yakoby et al. 2000). 

Wu et al. (1997) demonstrated that deletion of two e«do-|3-l,4-xylanase 
genes of M. grisea led to synthesis and secretion of additional hitherto 
unknown isozymes by the rice blast fungus. This again may be taken as 
evidence for the important role of these enzymes in host penetration and/or 
spread in the tissue. As this fungus is known to penetrate forcefully, little 
attention has so far been paid to the contribution of CWDEs for penetration 
in this system. 

The wealth of data available indicates that it is very likely that both, 
enzymes and force are needed for initial penetration, the proportions being 
characteristic for each individual pathogen. This had been demonstrated 
for the infection process of the powdery mildew fungus of barley, Blumeria 
graminis f. sp. hordei. Although the appressoria of this economically im- 
portant pathogen are not melanised, they develop a maximum turgor 
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pressure of approx. 2-4 MPa, and the resulting force may act in concert with 
CWDEs such as cellulases, which have been shown to be present at the 
penetration site (Pryce-Jones et al. 1999). Irrespective of the mechanism of 
penetration, spread within the tissue will require the action of CWDEs in 
all invasively growing filamentous fungi. Also in other fungi with well-de- 
veloped appressoria, for instance the rust fungus Uromyces fabae, CWDEs 
are thought to contribute to the infection process (Fig. 2) (Mendgen and 
Deising 1993; Mendgen et al. 1996). 



2.3 The Role of Cell Wall-Degrading Enzymes in Fungi 
Without Elaborate Appressoria 

CWDEs may not only decrease plant cell wall strength to facilitate inter- 
and intracellular growth and wall penetration, but also serve saprophytic 
functions, i.e. provide the fungus with nutrients. Targeted inactivation of 
genes encoding CWDEs would allow one to address the role(s) of these 
enzymes in pathogenesis directly, but their redundancy prevented to obtain 
clear results with this approach. In the polyphageous fungus Botrytis dri- 
er ea> causing disease in at least 235 plant species, 14 polygalacturonase 
isoforms (and 6 polygalacturonase-encoding genes) have been demon- 
strated in addition to other pectic enzymes (ten Have et al. 2002). Expres- 
sion data have been presented for the polygalacturonase gene family of B . 
cinerea. The most prominent transcripts originate from the genes Bcpgl 
and Bcpg2 . These genes do not appear to be under catabolite repression, 
whereas Bcpg4 is currently the only one that is. The final degradation 
product, galacturonic acid, induces Bcpg4 and Bcpg6 , whereas Bcpg3 is 
induced by low ambient pH and is not much affected by the carbon source 
present in the growth medium (Wubben et al. 2000; ten Have et al. 2002). 
As the genes differ in their expression patterns they may allow the fungus 
to quickly adapt to changes in the nutrient basis, and help to adapt to the 
different tissues of the various host plants. One of the most important 
findings was that Bcpgl knock-out mutants show reduced virulence on a 
number of hosts, including tomato leaves and fruits, apple fruits, and broad 
bean and Arabidopsis thaliana leaves (ten Have et al. 1998, 2002). 

Endo- polygalacturonase genes of two closely related Alternaria species, 
i.e. Alternaria citri and Alternaria alternata rough lemon pathotype, ex- 
hibit 99% nucleotide sequence homology. Inactivation of the gene in A. citri 
led to significant reduction of virulence, whereas knock-out of the A. 
alternata gene had no effect (Isshiki et al. 2001). However, as A. citri seems 
to lack toxins, CWDEs may play a more important role in the infection 
process of this fungus, and this result may indicate that CWDEs represent 
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basal virulence factors that act in addition to other known or hitherto 
unknown factors (ten Have et al. 2002). In other systems pectate lyases have 
been identified as virulence factors. The Nectria haematococca ( Fusarium 
solani f. sp. pisi) genome contains a family of at least four pectate lyase 
genes, in addition to other genes encoding depolymerases such as poly- 
galacturonases. Aggressive and non-aggressive isolates show minor differ- 
ences in polygalacturonase activity in vitro, but aggressive isolates exhibit 
much higher pectate lyase activities, suggesting that these enzymes may 
determine virulence (Kolattukudy and Crawford 1987). While the pectate 
lyase gene pelA is induced when the fungus grows on pectate, pelD is only 
induced during inplanta growth and not inducible in vitro. Inactivation of 
either gene alone does not interfere with virulence, but pelA I pelD double 
mutants show drastic virulence losses (Rogers et al. 2000). 

In the ergot fungus Claviceps purpurea, not only polygalacturonase, but 
also xylanase is a virulence factor. This fungus attacks mainly rye, but 
several other grasses can also be infected. Infection by this fungus is organ- 
specific. It exclusively attacks young ovaries and replaces them with its own 
sclerotia. As only limited host cell death occurs, C. purpurea can be regarded 
as a biotroph (Tudzynski et al. 1995). Two e/tdo-polygalacturonase genes, 
cppgl and cppg2 , of this fungus have been cloned and characterised. They 
are closely linked in a head-to-tail arrangement and show a high degree of 
homology, suggesting that a recent gene duplication event has occurred 
(Tenberge et al. 1996). Mutants deleted for both genes were almost non- 
pathogenic on rye, and complementation of the mutants with wild-type 
copies of both genes fully restored virulence, proving that endo- polygalac- 
turonase activity is essential to pathogenic development of the ergot fungus 
(Oeser et al. 2002). In addition to endo-polygalacturonase, xylanase has 
been shown to be essential for virulence of C. purpurea. In general, this can 
be anticipated for grass pathogens, as cell walls of the Poales contain high 
amounts of glucurono-arabino-xylans (Carpita and Gibeaut 1993). Two 
xylanase genes, cpxyll and cpxyl2 , were cloned from C. purpurea. While 
mutants deficient in cpxyll showed wild-type virulence, cpxyl2 mutants and 
cpxyll / cpxyll double mutants were drastically reduced in virulence (Gies- 
bert et al. 1998; ten Have et al. 2002). 

The maize pathogen Cochliobolus carbonum is currently the most inten- 
sively studied system with respect to CWDEs. This fungus does not differ- 
entiate distinct elaborate appressoria, but penetrates the host from hyphal 
swellings (Walton et al. 1995). Walton and coworkers have cloned and 
characterised several genes encoding CWDEs, including xylanase, cellulase, 
protease, polygalacturonase, P-l,3-glucanase and mixed-linked gluca- 
nases. Transformants with single to quadruple mutations do not show 
reduced virulence. However, as plant cell walls represent complex polymer 
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networks (Carpita and Gibeaut 1993), simultaneous reduction in several 
different CWDE activities might be required to decrease the efficiency of 
invasion and spread within the plant tissue. In an elegant approach to 
overcome the problem of gene redundance, Tonukari et al. (2000) cloned 
and inactivated the ortholog of yeast SNF1 (sucrose non-fermenting) gene. 
The SNF1 protein kinase phosphorylates the repressor of catabolite-re- 
pressed genes, MIGlp (the ortholog of which is CreAp in filamentous 
fungi), leading to de-repression of such genes. It is thought that many genes 
encoding CWDEs are subject to catabolite repression in filamentous fungi, 
and, indeed, growth of C. carbonum ccsnfl mutants was reduced by 50 to 
95% on complex carbon sources such as xylan, pectin, or purified maize cell 
walls, correlating well with significantly reduced transcript levels and ac- 
tivities of CWDEs. In the ccsnfl deletion mutant, production of HC toxin 
(see paragraph 3.2) conidiation, conidial morphology and germination, in 
vitro appressorium formation and growth on glucose, fructose, or sucrose 
was normal. The mutant formed fewer spreading lesions and thus was less 
virulent on susceptible maize. The results suggest that ccSNFl - and 
CWDEs - are required in pathogenesis by C. carbonum. However, as growth 
on galactose, galacturonic acid, maltose, xylose or arabinose was reduced, 
one may question the specificity of the approach, as basic metabolism is 
also affected by the mutation (Tonukari et al. 2000; ten Have et al. 2002). 
Several genes encoding CWDEs are not catabolite-controlled, so that this 
approach may not be useful in all plant pathogenic fungi. 

Taken together, recent research has presented evidence that CWDEs 
contribute to the infection process in several plant pathogenic fungi. It can 
be anticipated that CWDEs are needed for initial penetration - particularly 
in those fungi that do not form elaborate appressoria - and for spreading 
within the host tissue. 



3 Colonisational and Nutritional Strategies 

Some taxa of plant-associated fungi (genus or higher levels) show the same 
type of colonisation and nutrition in all their species. Whereas the rusts and 
powdery mildews only include obligate biotrophs, genera like Botrytis, 
Cochliobolus, Alternaria and many others comprise necrotrophic species. 
Other fungal taxa exhibit rather a range of nutritional styles when compar- 
ing different species. Such groups offer the opportunity to identify the genes 
defining nutritional strategies and to study their evolution within a re- 
stricted phylogenetic context. An example of contrasting life styles within 
a single genus is Cladosporium, where saprotrophic, epiphytic, endophytic, 
facultative biotrophic or even mycoparasitic species have been described 
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(David 1997; Moricca et al. 1999; Wirsel et al. 2002). Also, the genus 
Colletotrichum exhibits diverse colonisation strategies ranging from asymp- 
tomatic endophytic or epiphytic non-pathogenic fungi to hemibiotrophic 
or necro trophic plant pathogens (Perfect et al. 1999; Freeman et al. 2001; 
Latunde-Dada 2001). Making definitions even more complicated, this can 
vary for certain Colletotrichum species, depending on the colonised host 
species (Horowitz et al. 2002). Initial genetic evidence indicates that nutri- 
tional styles within this genus might be defined by just a few genes as shown 
for Colletotrichum magna> where mutations at single loci could change its 
mode from a hemibiotrophic pathogen to a non-pathogenic, mutualistic 
endophyte (Freeman and Rodriguez 1993; Redman et al. 1999). It remains 
to be shown which environmental factors might in addition influence this 
apparently finely tuned balance. In the following three paragraphs we will 
introduce one well-studied example for each of the most important nutri- 
tional types to demonstrate that plant pathogenic fungi have evolved quite 
different strategies to retrieve nutrients from their hosts after they have 
penetrated the plant surface (as outlined in Sect. 2). 



3.1 Obligate Biotrophy (The Rusts) 

This nutritional mode is typified by several remarkable features. First, in 
extant organisms it is only found in three phylogenetically very distant taxa, 
namely the rusts and powdery mildews, which are higher fungi, and in the 
downy mildews, which belong to the stramenopila that also include diatoms 
and some other groups of algae. 

Second, one common specialised cell type, the haustorium, exists that 
evolved most likely independently in the ancestors of these organisms. 
After the early differentiation processes on the host surface culminating in 
appressorial development and the entering of the interior of the leave 
(Fig. 2B), dikaryotic hyphae of rusts grow in the apoplastic space. Unknown 
signals lead to the differentiation of haustorial mother cells that closely 
adhere to mesophyll cells and that breach the host wall with a penetration 
peg. Inside the mesophyll cell a terminal, globose haustorium is formed 
without disrupting the host plasma membrane (Fig. 2A). The extrahausto- 
rial matrix separates fungal and host plasma membranes, and this interface 
allows exchange of metabolites and putatively also of molecular signals. 
Apparently, obligate biotrophs have recruited and modified existing mor- 
phogenic programs since haustorial mother cell differentiation shows some 
similarity to that of appressoria. In both cases terminal swellings of hyphae 
are used to initiate penetration of the rigid host cell wall. It is currently 
uncertain if haustorial mother cells are able to generate great osmotic 
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pressure as demonstrated for appressoria of several fungi (see Sect. 2.1). In 
contrast, it has been indicated for Uromyces fabae that secreted hydrolases 
seem to locally weaken or even dissolve that area of the wall where the 
penetration peg grows through (Deising et al. 1995a,b). 

Although it has been assumed for quite a while that haustoria serve as 
organs for absorption of nutrients from the host, it was only recently that 
molecular data have become available which indicated how this process 
operates. Several cDNAs encoding plasma membrane-localised trans- 
porters were recovered from a library enriched for haustorially expressed 
mRNAs (Hahn and Mendgen 1997). 

Two amino acid permeases were analyzed in greater detail. AATlp seems to have a rather 
broad substrate spectrum with highest activities for histidine and lysine as shown by 
heterologous expression in yeast and Xenopus oocytes (Struck et al. 2002). Its transcription 
does not appear to be under strict developmental control whereas that of the second putative 
amino acid permease, AAT2p, is (Hahn et al. 1997b). AAT2p was localized at haustorial 
plasma membranes by immunofluorescence microscopy, but its substrates are not yet 
known. The expression of a gene encoding a hexose transporter, HXTlp, is also develop- 
mentally regulated (Voegele et al. 2001). Heterologous expression experiments revealed that 
HXTlp specifically transports glucose and fructose. Immunofluorescence and electron 
microscopy have indicated that the localisation of HXTlp is restricted to the haustorial 
plasma membrane. A model for proton symport across the haustorial plasma membrane 
has been proposed for these transporters (Hahn and Mendgen 2001). A gene, PMA1 , 
encoding a plasma membrane H + ATPase that could energise this process has been cloned 
(Struck et al. 1998b). 

Molecular evidence revealed that the regulation of several host genes for 
amino acid and carbon metabolism was also affected during biotrophic 
growth of rust (Wirsel et al. 2001). Alteration of host gene expression was 
not only observed in infected leaves, but also detectable in all vegetative 
organs, which might reflect changes in source-sink relationships in the 
whole plant. Apparently, through the differentiation of haustoria, obligate 
biotrophs have evolved an efficient way of retrieving nutrients from the host 
by preferentially localising at least some of their amino acid and carbohy- 
drate transporters at the haustorial membrane. 

A third remarkable feature of obligate biotrophy is the fact that this 
plant-fungal association is active for up to several months without visible 
host defense reactions occurring. Obviously, since obligate biotrophs de- 
pend on living cells, they must prevent the elicitation of the HR when the 
haustorial mother cell contacts a mesophyll cell and establishes the haus- 
torium. Theoretically, obligate biotrophs can evade host defense reactions 
by at least two strategies for both of which only limited experimental 
evidence is available yet. First, they could prevent the expression of or 
modify those molecules that could be recognised by the surveillance sys- 
tems of innate plant immunity. Work that had been carried out with 
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dikaryotic Uromycesfabae shows that those cell types differentiating on the 
host leaf contain chitin as a prominent wall polymer (Deising and Siegrist 
1995). Chitin, in contrast to citosan, is a target for plant-derived chitinases 
that generate highly elicitor-active chitin-oligomers (Ride and Barber 
1990). Those infection structures that differentiate within the host leaf seem 
to have at least partially converted the surface-associated chitin into chi- 
tosan by means of chitin deacetylase. Chitosan is assumed to have reduced 
or no elicitor-activity (e.g. Vander et al. 1998). Labelling patterns of infec- 
tion structures of several rust species suggest that infection-related modi- 
fication of chitin is critical to the success of the pathogen (Deising et al. 1996; 
El Gueddari et al. 2002). A second strategy probably used by obligate 
biotrophs to prevent host defense reactions could involve active suppres- 
sion, e.g. via affecting host signalling pathways somewhere between initial 
recognition of the pathogen and the final defense response. Genetic evi- 
dence comes from analysis of the maize lethal leaf spot 1 mutant ( llsl ) that 
forms spreading necrotic lesions, conferring enhanced resistance to fungal 
pathogens. 

It was suggested that the wild-type gene Llsl suppresses cell death by preventing reactive 
oxidative species formation or removing a cell death mediator (Gray et al. 1997). Two fungal 
pathogens that exhibit compatible interactions with maize, one necrotroph, Cochliobolus 
heterostrophus race O (see below), and one obligate biotroph, Puccinia sorghi, were com- 
pared on llsl plants (Simmons et al. 1998). Both fungi were affected at developmental stages 
on the leaf surface, which was reflected by reduced rates of germination and appressorium 
formation. Whereas C. heterostrophus was severely impeded to spread through host tissues, 
P. sorghi still produced pustules at somewhat reduced frequency. Apparently, in cases where 
P. sorghi successfully penetrated, it spread without eliciting llsl lesions. 



The ability of rust fungi to intersect plant defense seems to be host specific, 
since they elicit various defense responses in non-host plants (Heath 1997). 
Recent data suggest, that plasma membrane-cell wall adhesion is essential 
for the expression of cell wall-associated responses (Mellersh and Heath 
2001 ). 

In its monokaryotic form Uromyces vignae infects susceptible cultivars of its host cowpea 
by direct penetration of epidermal cells through appressoria. It has been shown for this 
interaction that disruption of the membrane-wall adhesion occurred exactly where the 
fungus penetrated the host cell and that this event suppressed further defense reactions 
(Mellersh and Heath 2001). 



3.2 Necrotrophy ( Cochliobolus spp.) 

As the name implies, fungi exhibiting necrotrophy kill host tissues and use 
the contents of dead or dying cells along with the digestion products of 
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polymeric cell wall materials for nutritional purposes. Two main features 
are known to characterise this life style, first, the secretion of large amounts 
of lytic enzymes degrading the constituents of plant cell walls, i.e. cellulases, 
pectinases, xylanases, proteases and in some cases lignin-degrading en- 
zymes as well. Most of these enzymes are encoded by multigene families 
that confer functional redundancy to the system and make the analysis of 
their individual importance for pathogenicity by gene deletion approaches 
quite cumbersome as described in Section 2. 

A second well-investigated feature of necrotrophic fungi is the produc- 
tion and secretion of phytotoxins that kill host cells in advance of the 
growing hyphae, thereby reducing the impact of host defense on the patho- 
gen. Since most molecular investigations on fungal toxins have been carried 
out with Cochliobolus species, we focus the following discussion on this 
genus. 

Over the years, C. heterostrophus (anamorph names: Helminthosporium 
maydis , Drechslera maydis and Bipolaris maydis) has become one of the 
best-studied systems for the evolution of fungal virulence factors. This 
fungus causes a disease called southern corn leaf blight and two races, race 
0 and race T, were isolated from infected corn fields. Race T had been 
detected in the USA first in 1969 on corn cultivars with the Texas male sterile 
cytoplasm, which had been widely planted by then (Hooker 1974). It caused 
severe epidemics until it was recognized that the specific genetic constitu- 
tion of the plant was apparently responsible for the spreading of this new 
race of the pathogen. By changing to host cultivars that did not rely on the 
Texas cytoplasm for producing hybrid male sterile seeds, C. heterostrophus 
was not a threat to corn crops any longer. Only race T produces T-toxin, 
which is a blend of linear polyketoles with a length of C35 to C41 (Kono and 
Daly 1979). All of them produced - when applied in pure form - large 
chlorotic lesions on Texas cytoplasm corn as did the fungus. T-toxin is a 
virulence factor, since race 0 which does not produce T-toxin is a milder 
pathogen causing only small necrotic lesions. 

The molecular target identified to bind T-Toxin is a 13-kDa protein (urfl3p) that is only 
found in the inner mitochondrial membrane of Texas cytoplasm corn but not in cultivars 
with the N-cytoplasm (normal cytoplasm) (Levings 1990). A mitochondrial locus, T-urfl3 , 
that had been selected for during the breeding for male sterility, encodes urfl3p (Dewey et 
al. 1986; Wise et al. 1987). Therefore, plant breeders inadvertently introduced a strong 
selective advantage to a fungal race that was less competitive on the N-cytoplasm cultivars 
planted before. urfl3p forms oligomers, probably tetramers, within the inner mitochondrial 
membrane (Rhoads et al. 1995). After binding of T-toxin, the complex changes its confor- 
mation, resulting in the creation of pores through which small molecules can diffuse thereby 
leading to the collapse of respiratory functions and cell death. 

Classical genetic analyses have demonstrated a 1:1 segregation of parental phenotypes 
in a cross between race 0 and race T, suggesting that a single locus ( Toxl ) is responsible for 
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producing T-toxin (Tegtmeier et al. 1982). It was also shown that Toxl is closely linked to a 
reciprocal translocation breakpoint (Bronson 1988; Chang and Bronson 1996). Sophisti- 
cated genetic analysis and electrophoretic karyotype comparisons of race 0 versus race T 
revealed that in the latter Toxl is actually spread over both of those breakpoints that led to 
their designation as Toxl A and ToxlB (Kodama et al. 1999). Cloning of the first of the genes 
involved in T-toxin production took advantage of the REMI (restriction enzyme-mediated 
integration of DNA) procedure (Lu et al. 1994). ChPKSl encodes a multifunctional 
polyketide synthase that is composed of six domains theoretically permitting to carry out 
most of the synthetic steps necessary for T-toxin production. While ChPKSl is located at 
Toxl Ay two linked genes for T-toxin production, ChDECl (acetoacetate decarboxylase) and 
ChREDl (medium-chain reductase), could be cloned from ToxlB (Rose et al. 2002). At both, 
ToxlA and ToxlB , probably as yet undiscovered genes are located that are involved in 
T-toxin production. Several unusual features of the three cloned genes suggested a horizon- 
tal gene transfer event in a race 0 progenitor to explain the appearance of race T. These genes 
exhibit a lower GC content than the average Cochliobolus gene, a codon usage deviating from 
that typically found in filamentous fungi, and they are lacking in race 0 and other species of 
Cochliobolus or related genera analyzed. However, the origin and the timing of the horizontal 
DNA transfer remain unknown. 

A second Cochliobolus species, G carbonum (anamorph names: Helmin- 
thosporium carbonum and Bipolaris zeicola) has been particularly well 
analysed with respect to toxin production. G carbonum comes also in two 
races; race 1 is highly virulent on certain susceptible corn cultivars causing 
a disease called northern corn leaf spot, whereas race 2 has only limited 
potential as a pathogen. Only race 1 produces a phytotoxin, HC-toxin, 
which genetically cosegregates with a single locus, Tox2 (Yoder 1980; Bron- 
son 1991). HC-toxin is a cyclic tetrapeptide (cyclo D-Pro, L-Ala, D-Ala, 
L-Aeo; Aeo abbreviates 2-amino-9,10-£poxy-8-nxodecanoic acid) that is 
non-ribosomally synthesised (Walton et al. 1982). HC-toxin is a patho- 
genicity factor because wild-type strains or mutants at Tox2 are practically 
non-pathogenic. Susceptibility to race 1 occurs when maize carries ho- 
mozygous mutations at two unlinked loci, Hml and Hm2 , conferring the 
same function. It was suggested that this genotype ( hml/hml ; hm2/hm2) is 
a result of inbreeding and arose inadvertently in corn breeding programs 
in Kansas. C. carbonum was supposedly not prevalent or not a competitive 
pathogen under the dry conditions of that area which prevented disease 
outbreak (Multani et al. 1998). Transfer of this genotype to cultivars planted 
in the warm and humid corn belt of the USA, where C. carbonum had been 
present before, resulted in disease in the late 1930s. 

Only the extracts from resistant but not from susceptible cultivars contained enzymatic 
activity, HC-toxin reductase (HCTR) that could eliminate the toxic potential in vitro (Meeley 
et al. 1992). The gene at the HM1 locus has been cloned by a transposon tagging procedure. 
Its sequence showed similarity to dihydroflavonol-4-reductase genes from several plants 
(Johal and Briggs 1992). Some susceptible cultivars have been shown to carry transposon 
insertions at hml and deletions at hm2 (Multani et al. 1998). Interestingly, HCTR activity 
has been detected in other monocots. Homologues of HM1 and HM2 were discovered at 
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syntenic positions in several other grasses but not in dicots (Multani et al. 1998). One HCTR 
homologue was cloned from barley and found to be highly similar to that of corn (Han et 
al. 1997). Apparently, monocots use an ancient defense mechanism for detoxifying fungal 
cyclic peptides and resistance provided by HCTR collapsed in the case of maize when new 
genotypes of the pathogen and the host interacted. 

HC-toxin shows cytostatic activity towards susceptible host cells in contrast 
to the cytotoxic activity exhibited by T-toxin and other fungal phytotoxins. 
Pharmacological studies revealed that HC-toxin reversibly inhibits not only 
corn, but also mammal and oomycete histone deacetylase (Brosch et al. 
1995). It was discussed that instead of killing host cells, HC-toxin rather 
prevents the modification of chromatin, which seems to be necessary to 
coordinate the transcription of plant defense genes. Again, it should be 
noted that in maize and probably other monocots the wild-type condition 
is resistance to HC-toxin, meaning that the plant counteracts a putative 
suppression of defense reactions by modifying the fungal toxin. 

Electrophoretic karyotype analyses and physical mapping demonstrated 
that the genetic locus in C. carbonum race 1 responsible for HC-toxin 
production, Tox2 , comprises more than 540 kb and indicated that three 
genes known to be involved in toxin production were present in multiple 
copies. The lack of Tox2 in the genome of race 2 and suppression of 
recombination were discussed as reasons for the segregation of Tox2 as a 
single unit in crosses between race 1 and race 2 (Ahn and Walton 1996). 
A refined map of Tox2 showed that all copies of the genes known by then, 
with the exception of one, are combined in a huge gene cluster at that locus 
(Ahn et al. 2002). Currently, seven genes at Tox2 have been cloned and 
analysed. 

Race 1 carries two copies of the first cloned gene at Tox2, HTS1, and it was shown that loss 
of toxin production and pathogenicity occurred only after deleting both (Panaccione et al. 
1992). Sequence analysis of HTS1 revealed an intronless 15.7-kb open reading frame encod- 
ing a large cyclic peptide synthetase containing four domains. Each domain seems to catalyse 
aminoacylation and thioesterification steps for one of the four amino acids present in 
HC-toxin. In addition, an epimerisation activity was also presumed within HTS1 (Scott- 
Craig et al. 1992). Interestingly, the HTS1 regions are flanked by repetitive elements that 
exhibit similarity to fungal transposons of the Fotl family (Panaccione et al. 1996). Besides 
HTS1 additional genes appeared to be present in multiple functional copies at Tox2. TOXA 
is found in two copies, each closely linked to one of the HTS1 copies. It encodes a putative 
efflux pump for HC-toxin, which is thought be responsible for secretion of the toxin and 
self-protection of the fungal cell, since deletions were only obtained when targeting one but 
not both copies (Pitkin et al. 1996). A mutant with deletions in all copies of TOXC was not 
able to produce HC-Toxin and was non-pathogenic. TOXC showed similarity to fatty acid 
synthases and it was proposed to be necessary for the synthesis of L-Aeo (Ahn and Walton 
1997). Currently, the function of TOXD remains unknown. TOXE encodes a transcription 
factor whose proposed function is to coordinate the regulation of the TOX genes. TOXE has 
a unique modular structure, i.e. a bZIP basic DNA binding domain and four ankyrin repeats 
but no leucine zipper or helix-loop-helix domains. Deletion of TOXE resulted in a mutant 
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lacking the capability to produce HC-toxin and to infect the host, indicating that it is 
pathway-specific (Ahn and Walton 1998). A ten-base motif (tox-box) had been located in 
the promoters of all TOX genes except TOXE itself and it was demonstrated that TOXEp 
binds in vitro and in vivo to the tox-box (Pedley and Walton 2001). The TOXF derived 
product shares similarity with branched-chain amino-acid transaminases and it was sug- 
gested to aminate a precursor of L-Aeo. A mutant with disruptions in all TOXF copies did 
not produce toxin and was non-pathogenic (Cheng et al. 1999). The last of the currently 
known TOX genes, TOXG, encodes an alanine racemase that provides the D-Ala found in 
HC-toxin. Disruptions of both TOXG copies resulted in a mutant still making a side product 
of the pathway that retained some toxic activity. This mutant exhibited only slightly reduced 
virulence in infection assays (Cheng and Walton 2000). 

In conclusion, the two examples from the genus Cochliobolus detailed 
above display some resemblances. Genes responsible for producing host- 
selective fungal toxins have probably been acquired by horizontal gene 
transfer from presently unknown sources. These genes are organised into 
large gene clusters that are probably under a common regulatory control, 
in analogy to other fungal secondary metabolite pathways. In both cases a 
relatively benign pathogen turned highly virulent when meeting susceptible 
host plants. The latter became only available in the field after traditional 
breeding programs introduced new host cultivars carrying an altered trait. 
These cases also highlight the intricate interaction between the genomes 
and the physiology of fungal pathogens and their hosts. 



3.3 Hemibiotrophy ( Colletotrichum spp.) 

As mentioned above, Colletotrichum harbours species with various nutri- 
tional modes. However, the majority of species seem to follow the hemibio- 
trophic mode. This mode involves two phases, a biotrophic phase which is 
followed by a necrotrophic phase that finally leads to the destruction of 
colonised host tissues. Biotrophy lasts from about 1 to 3 days, depending 
on the species, and is therefore much shorter than that of obligate bio- 
trophs. Colletotrichum species provide an easier access than obligate bio- 
trophs to study biotrophic interactions by molecular approaches since they 
can easily be cultivated in vitro and transformed. Once the total genome 
sequence of such an organism will be available, it will be a good system to 
directly compare by microarray technology biotrophic and necrotrophic 
nutritional modes in the same genetic background. This information might 
allow the identification of those genes defining these nutritional styles. 

As outlined in Section 2, hemibiotrophic Colletotrichum spp. directly 
penetrate into epidermal cells by appressoria and infection hyphae (Fig. 1). 
The latter differentiate a specialised cell type, the infection vesicle, whose 
shape varies from spherical to branched, depending on the species 
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(Fig. 2C). This structure is similar to the haustorium of obligate biotrophs, 
and again the invaginated host cell stays alive and a special interface is 
created between the plasma membranes of the host and the pathogen. In 
some species like C. truncatum and C. graminicola , the biotrophic phase 
appears to be restricted to the first infected epidermal cell and is terminated 
when thin secondary hyphae emerge from it to invade surrounding cells 
that are killed upon (localised biotrophy) (O’Connell et al. 2000; Latunde- 
Dada 2001). If biotrophic primary hyphae arise from infection vesicles in 
these species they do not proliferate into neighbouring cells but rather 
remain in the infected epidermal cell. In other species, like C. linde- 
muthianum and C. sublineolum , the biotrophic phase expands with thick 
primary hyphae invading neighbouring cells. The biotrophic phase is there- 
fore slowly spreading through host tissue and since biotrophy is also here 
transient in nature, cells that were infected earlier successively become 
necrotic (sequential biotrophy) (O’Connell et al. 2000; Latunde-Dada 2001 ). 

Hemibiotrophic Colletotrichum spp. in their biotrophic phase do not 
appear to be recognised and rejected by their hosts defense systems, similar 
to the situation with powdery mildew or rust fungi. Also here, evidence is 
emerging that modification of fungal surfaces and active suppression of 
host pathways might be involved to escape defense. As is the case for 
Uromycesfabae and Puccinia graminis f. sp. tritici , C. graminicola seems to 
modify its chitin layer to chitosan as soon as it penetrates the epidermal cell 
as shown by fluorescently labelled wheat germ agglutinin detecting chitin 
and antibodies recognising chitosan (El Gueddari et al. 2002). A cDNA 
encoding a protein with similarity to cell-wall-associated receptor kinases 
was cloned from C. gloeosporioides cultures starved for nitrogen (Stephen- 
son et al. 2000). Expression of CgDN3p inplanta was highest in biotrophic 
infection vesicles as demonstrated by fusions to GFP. Interestingly, deletion 
mutants in that gene elicited an HR-like phenotype on a susceptible host 
where this is normally not occurring. In other words, wild-type C. 
gloeosporioides seems to suppress HR in a compatible interaction. 

In another investigation, expression of PR- 10 and chalcone synthase of sorghum was 
compared by Northern analyses in two incompatible interactions. The particular cultivar 
used displayed host-specific resistance to the tested strain of hemibiotrophic C. sublineolum , 
a species that is pathogenic to sorghum, and it displayed non-host resistance towards 
necrotrophic Cochliobolus heterostrophus (Lo et al. 1999). When inoculated with C. hetero- 
strophus , transcripts of both genes became detectable after 4 to 6 h, whereas this took 36 to 
48 h after inoculation with C. sublineolum. Maximum transcript levels were similar in both 
interactions. It was speculated that the delayed induction of these defense related genes in 
the interaction with a hemibiotrophic fungus might be based on the (partial) suppression 
of host defense mechanisms. Unfortunately, there was no host cultivar included that was 
fully susceptible to the C. sublineolum strain tested and therefore it is uncertain if in this 
case the induction of defense genes would be weaker and/or more delayed. Cytological 
comparison of the two interactions indicated that development of C. heterostrophus was 
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blocked at the appressorial stage whereas C. sublineolum penetrated normally with appres- 
soria and also developed infection vesicles. This fungus was arrested when it started to 
proliferate into neighbouring cells (Lo et al. 1999). 

The switch from biotrophic to necrotrophic growth is under genetic regu- 
lation as was demonstrated for C. lindemuthianum. A tagged mutant in the 
gene CLTA1 was recovered in a screen for reduced pathogenicity. The 
mutant elicited an HR-like phenotype on a susceptible cultivar of common 
bean (Dufresne et al. 2000). This phenotype also occurred when inoculating 
the wild type onto a non-susceptible host cultivar. Cytology showed that 
the mutant was arrested in the biotrophic stage since it appeared to differ- 
entiate normally from conidial germination to infection vesicles and to 
biotrophic primary hyphae. However, secondary hyphae never developed. 
CLTAlp has sequence similarity to transcription factors of the zinc cluster 
(Zn[II](2)Cys(6)) family, which typically regulate distinct metabolic path- 
ways in fungi. It was speculated that CLTAlp might switch on genes 
necessary for necrotrophic nutrition (Dufresne et al. 2000). In any case, this 
gene appears to be important for regulating the transition in life style and 
it will be interesting to identify those genes, which are under its control. 
Recently, another mutant, which remained arrested in the biotrophic phase 
had been identified in C. graminicola by REMI mutagenesis (Thon et al. 
2002). The tagged gene, CPR1 , putatively encodes a subunit of the eu- 
karyotic signal peptidase complex. A deletion of the gene was not achieved, 
indicating that it might be essential for growth and it appeared that the 
original mutant had only been recovered since the insertion occurred in the 
3’-untranslated region close to the stop codon. The mutant exhibited re- 
duced transcript levels, which was taken as evidence for an inability to 
secrete sufficient amounts of hydrolytic enzymes during necrotrophic 
growth, which would be not essential for biotrophic growth. 

Thirteen infection specifically expressed sequence tags (ESTs) corresponding to genes 
activated during early stages of the interaction between C. graminicola , and its host, lea 
mays , have been isolated by sequential subtractive hybridisation. Identification of the 
deduced proteins and RT-PCR studies suggest that considerable reprograming of the protein 
expression patterns occurs during the shift from biotrophic to necrotrophic infection 
phases. In addition, ESTs that share no similarity with known sequences in the database, 
thus representing hitherto unknown genes, have also been isolated (Sugui and Deising 2002). 
Functional analysis of these will show whether or not they play a role in the C. graminicola 
- maize interaction. 

Necrotic lesions are typically characterised by anthracnose and blight 
symptoms. Cell wall-degrading enzymes - as discussed in Section 2 - and 
in at least some species also phytotoxins are produced, which leads to the 
killing of host cells in front of the spreading hyphae. Toxin production has 
been reported for C. falcatum (Naik and Vedamurthy 1997), C. Unde- 
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muthianum (Fernandez et al. 2000), C. dematium (Yoshida et al. 2000), C. 
gloeosporioides (Barjau et al. 1995; Jayasankar et al. 1999), C.fuscum (Good- 
man 1960) and C. acutatum (Jayasinghe and Fernando 2001). To our 
knowledge, the molecular basis of toxin production by Colletotrichum is 
not known in any of these cases. 



4 Infection Structures as Fungicide Targets 

Depending on the crop, fungal plant pathogens cause significant pre- and 
post-harvest losses worldwide, which may amount to 20% or more. Accord- 
ingly, fungicides contribute approximately 20% to the world market of 
agrochemicals, corresponding to 6 billion $US spent on plant protection 
each year (Hewitt 1998). Although several fungicide classes differing in 
their mode of action have been established, increasingly stringent biosafety 
and specificity requirements, together with occurrence of fungicide resis- 
tance in field populations of fungal plant pathogens, make the development 
of new fungicides with novel modes of action necessary (FRAC, Fungicide 
Resistance Action Committee 2002, http://www.frac.info/frac.html; Deising 
et al. 2002, and references therein). Technical progress and increased 
knowledge of fungal biology allow one to search for specific targets in 
fungal cells, thus broadening classical fungicide development consisting of 
random synthesis of chemicals in combination with screening for antifun- 
gal activity (Knight et al. 1997). A promising alternative that may circum- 
vent time consumption and reduce financial efforts is to identify fungicides 
produced by microbes. A well-known example is strobilurin A of Strobilu- 
rus tenacellus (Anke et al. 1977; Kraiczy et al. 1996). Although the microbial 
fungicide has been shown to be unstable in the light, chemical modification 
has led to several very successful fungicides that are well established on the 
market. Not only in fungal mycelia, but also in fruiting bodies substances 
with antifungal properties have been identified (Lorenzen and Anke 1998; 
Stadler and Sterner 1998), and it will be challenging to develop screening 
methods that allow one to identify not only activities directed against basic 
metabolism occurring in both saprophytic and pathogenic hyphae, but also 
against targets specifically needed for infection structure differentiation. 
Thus, infection structures (see above) may be regarded as excellent fungi- 
cide targets (Struck et al. 1998a), and some examples for this disease control 
strategy already exist. 

Following landing on the plant surface, fungal spores adhere (Nicholson 
1996) and germinate under permissive conditions. Germ tube growth and 
- in many cases - appressorium formation occurs, preceding invasion of 
the plant tissue. These developmental stages are targets of protective (or 
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pre-invasion) fungicides. Fungicides that interfere with germ tube growth 
are multi-site inhibitors structurally classified as dithiocarbamates (e.g. 
mancozeb, maneb or thiram) or phthalimides (e.g. captan and folpet). The 
dialkyldithiocarbamate anion either interferes with enzymes containing 
copper ions such as laccases or ascorbate oxidases, or binds to SH-groups 
and thus inhibits protein function. Binding of SH-groups has also been 
reported for phthalimide derivatives. 

Germination and germ tube growth are also inhibited by strobilurins, 
which are present in several modern fungicide mixtures. These inhibitors 
block mitochondrial electron transport via binding to ubiquinol oxidase of 
the cytochrome bci complex. Like the strobilurins, the antifungal com- 
pound cyazofamide inhibits the mitochondrial electron transport. Cyazo- 
famide is a specific inhibitor blocking ubiquinol reductase. Both fungicide 
groups thus efficiently interfere with mitochondrial ATP synthesis, which 
is essential for invasion of the plant and pathogenic development. An 
interesting group of fungicides are the anilino pyrimidins, as they not only 
inhibit methionine biosynthesis. Some members of this group, e.g. ando- 
prim, mepanipyrim and pyrimethanil reduce secretion of depolymerising 
enzymes such as pectinases, cutinases, lipases or proteases, some of which 
have been shown to be virulence factors (see above). This mode of action is 
particularly valuable to control pathogens that cause extended tissue mac- 
eration, e.g. B. cinerea infecting strawberries or C. gloeosporioides , Monilia 
fructigena> Penicillium expansum and P . italicum causing apple rot. 

Two groups of fungicides specifically interfere with two different targets 
of the melanin biosynthesis pathway and thus specifically interfere with 
appressorium function (Fig. 1 ). The rice blast fungus M. grisea and different 
Colletotrichum species synthesise melanin via 1,8-dihydroxynaphthalene 
(1,8-DHN) (Fig. 1; Kubo and Furusawa 1991), and reductase and dehy- 
dratase inhibitors are used as fungicides. The NADPH-dependent reductase 
that converts 1,3,6,8-tetrahydroxynaphthalene (1,3,6,8-THN) to scytalone 
and 1,3,8-trihydroxynaphthalene ( 1,3,8-THN) to vermelone is inhibited by 
isobenzofuranone, pyrrolquinoline and triazolbenzothiazole compounds, 
with tricyclazole being the most prominent fungicidal substance. The de- 
hydratase converting scytalone to 1,3,8-THN and vermelone to 1,8-DNH, 
respectively, are inhibited by the cyclopropanecarbroxamide fungicide 
carpropamid (Kurahashi and Pontzen 1998; Thieron et al. 1998). Inhibition 
of melanin synthesis results in reduced turgor pressure and reduced gen- 
eration offeree not sufficient for mechanical breaching of the host cell wall. 
Interestingly, as shown with melanin-deficient M. grisea isolates the dehy- 
dratase inhibitor and rice blast fungicide carpropamid not only blocks 
melanin biosynthesis, but also stimulates resistance responses of rice plants 
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(Thieron et al. 1998). The dual mode of action may help to avoid the 
development of fungicide resistance (Deising et al. 2002). 

Infection structure differentiation is initiated, as soon as the germ tube 
growing on the plant surface perceive signals typical of its host (Hoch et al. 
1987; Jelitto et al. 1994; Kolattukudy et al. 1995). Compounds involved in 
fungal signal transduction (Dean 1997; Deising et al. 2000) would be excel- 
lent fungicide targets, as inhibition of signal cascades would specifically 
inhibit pathogenic fungi that rely on infection structure differentiation. 
Interestingly, phenyl pyrrol fungicides such as fenpiclonil and fludioxonil 
inhibit MAP kinases, and the quinoline quinoxyfen interferes with the 
G-protein-mediated signal transfer (FRAC 2002, http://www.frac.info/ 
frac.html; Wheeler et al. 2000). As a consequence of application of the 
mildew-specific fungicide quinoxyfen, the fungus does not recognise its 
host plant surface, germ tube growth continues and appressorium differ- 
entiation and plant infection do not occur (Wheeler et al. 2000; Deising et 
al. 2002). 

Another inhibitor, called glisoprenin A, interfering with signal transduc- 
tion and appressorium formation has been isolated from submerged cul- 
tures of Gliocladium roseum (Thines et al. 1997). This compound inhibited 
appressorium formation of M. grisea on hydrophobic surfaces, resembling 
natural conditions, but infection structure induction by chemicals such as 
cAMP, 8,4-chlorophenylthio-adenosine-3’,5’monophosphate, 3-isobutyl- 
1-methylxanthine or the plant wax compound 1,16-hexadecanediol, was 
not affected. These data showed that two different signal transduction 
pathways for appressorium formation exist in M. grisea. Again, inhibitors 
isolated from micro-organisms offer a rich source of lead compounds for 
fungicides. 

The most prominent structural compounds of fungal hyphae are chitin 
and |3-l,3-glucans, which interconnect to form a rigid network determining 
hyphal shape (Wessels 1993). As most fungi contain significant amounts of 
these polymers, and as even cellulosic oomycetes synthesise chitin (Werner 
et al. 2002) and (5-1,3-glucans, chemicals that inhibit the synthesising en- 
zymes maybe excellent fungicides (Debono and Gordee 1994; Current et al. 
1995; Knight et al. 1997). These fungicides inhibit growth of vegetative and 
pathogenic hyphae. The polyoxin chitin synthase inhibitors have been used 
since 1967 to control Rhizoctonia solatii, Cochliobolus miyabeanus and 
Alternaria alternata, and glucan synthase inhibitors are important tools to 
control human invasive mycoses caused by Cryptococcus neoformans, As- 
pergillus fumigatus or Pneumocystis carinii (Debono and Gordee 1994; 
Urbina et al. 2000). 

Like cell wall biosynthesis inhibitors, sterol biosynthesis inhibitors 
(SBIs) are general fungicides in the sense that they do not discriminate 
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between pathogenic and saprophytic hyphae (Deising et al. 2002). Both 
groups of SBIs, C14-demethylase inhibitors and morpholines which block 
A 7 ^A 8 -isomerase, are easily taken up by plants and are thus systemic 
fungicides. SBI fungicides interfere with plasma membrane function. Im- 
portantly, these compounds are ideally suited to access inplanta growing 
pathogenic hyphae as well as infection structures growing on the plant 
surface (Deising et al. 2002, and literature therein). 



5 Concluding Remarks 

In this chapter, we have discussed several aspects of fungal morphogenesis, 
function of infection structures and different life styles of diverse plant 
pathogens. During the last years, several genes essential for efficient host 
invasion and colonisation have been identified. However, although many 
details are known, there still is a considerable lack of information, e.g. in 
regulatory control of morphogenesis, cell wall biogenesis and modification, 
and different metabolic pathways involved in synthesis of virulence or 
pathogenicity factors. Knowledge of factors required to induce infection 
structure differentiation may help to find highly specific antifungal agents 
that could be used as fungicides. Such compounds would be particularly 
interesting, as they hinder infection at an early stage and disease symptoms 
would not occur. As resistance against several of the most frequently used 
fungicides (strobilurins and SBIs) has been observed in different pathogens, 
discovery of new compounds is important not only in agriculture, but also 
in medicine. 
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Biotechnology: 

Production of Proteins for Biopharmaceutical 
and Industrial Uses in Transgenic Plants 

Kerstin Stockmeyer and Frank Kempken 



1 Introduction 

Recent advances in plant molecular biology have opened new strategies for 
the production of genetically engineered plants and for the precise transfer 
of new genes into crop plants from diverse sources. Many genes have been 
transferred through various transformation techniques including genes for 
several agronomically important traits such as herbicide resistance, disease 
and insect resistance. 

Over the last 8 to 1 0 years, it has even become apparent that plant systems 
may be particularly valuable for the expression and production of recom- 
binant proteins. Foreign genes are integrated into the plant genome and 
therefore traditional breeding techniques can be used to generate trans- 
genic seed stocks for easy and stable storage or distribution. 

Plant cells are able to synthesize, target and process complex mammalian 
proteins in a very similar manner to the natural hosts. Therefore, they are 
an attractive alternative to bacterial or mammalian systems for the produc- 
tion of recombinant proteins. 

Much experience has been accumulated in the manufacture of biophar- 
maceuticals in animal and bacterial cells, as well as in the biology and 
genetics of crop plants. Elements from these different disciplines can be 
brought together for the design of optimal production strategies. 



2 Transgenic Plants as Bioreactors 

Transgenic plants are attracting interest as efficient bioreactors for the safe 
and inexpensive production of large amounts of functional, recombinant 
therapeutically valuable proteins, such as blood substitutes, vaccines and 
antibodies. The production of recombinant blood factors for therapeutic 
use, such as antibodies or recombinant human hemoglobin, in transgenic 
plants has become an alternative to the isolation of these molecules from 
natural or other recombinant sources (Theisen 1999). Plant bioreactors 
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offer the possibility of an inexpensive large-scale production of high vol- 
umes of recombinant proteins with increased safety concerning contami- 
nations with human pathogens. 

Bacteria have often been the expression system used for the production 
of recombinant proteins, with low production costs, their straightforward 
molecular biology and easy handling (Olins and Lee 1993). Complex mam- 
malian proteins can also be produced in transformed plants or transformed 
plant suspension cells (Fischer et al. 1999a). Plants are attractive for the 
production of pharmaceutical proteins on a field scale because the ex- 
pressed proteins are functional and almost identical to their mammalian 
counterparts. Regarding protein folding and structure, small peptides, 
polypeptides and even complex proteins can be expressed in plants that are 
fully assembled (Fischer and Emans 2000). 

The first transgenic plants were reported in 1983 (Fraley et al. 1983; 
Zambryski et al. 1983). Many recombinant proteins have been expressed in 
several important agronomic plant species including tomato, potato, to- 
bacco and corn since then (Austin et al. 1993). Examples are given in 
Table 1. 

Bacterial or mammalian expression systems have significant limitations. 
Bacteria cannot perform the complex posttranslational modifications that 
are required for bioactivity of many human proteins and high-level expres- 
sion often leads to accumulation of insoluble protein aggregates. While 
mammalian cell cultures perform the required protein modifications, in- 
stability of selected cell lines, low transgene expression levels, and the 
difficulties and high expense of scale-up are often limiting or severely 
impact cost. Furthermore, animal tissues bare potential viral or infectious 
agent contamination risks and allergic reaction problems. 

Genetically engineered transgenic plants have several advantages as 
protein sources compared with human or animal fluids/tissues, recombi- 
nant microbes, transfected animal cell lines or transgenic animals: 

- Plants are easy to grow and unlike bacteria or animal cells their cultiva- 
tion is straightforward and does not require special equipment or media 
or involve toxic chemicals. 

- They produce a large amount of biomass and protein production can be 
increased using plant suspension cell culture in fermenters (Fischer et 
al. 1999a), or by the propagation of stably transformed plant lines in the 
field. 

- The storage of genes and gene products in plants can be very stable. 
Transgenic plants can be self- fertilized to produce stable breeding lines, 
propagated by conventional horticultural techniques and distributed 
and stored as seeds. 
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Table 1. Expression of recombinant biopharmaceuticals in transgenic plants 


Protein 


Plant 

host 


Potential 
application 
or indication 


Expression 
levels 
(% of total 
soluble protein) 


Reference 


Human 
interferon a 


Rice 


Hepatitis C and 
B treatment 


Not reported 


Zhu et al. 
(1994) 


Human 

epidermal growth 
factor 


Tobacco 


Wound repair, 
control of cell 
proliferation 


0.001 


Higo et al. 
(1993) 


Glucocerebrosidase 


Tobacco 


Gaucher’s disease 


1-10 


Cramer et al. 
(1996) 


Human hemoglobin 
a and P 


Tobacco 


Blood substitute 


0.05 


Dieryck et al. 
(1997) 


Human serum 
albumin 


Potato 


Liver cirrhosis, 
burns, surgery 


0.02 


Sijmons et al. 
(1990) 


E.coli enterotoxin B 


Potato 


Immunogenicity 


0.003 


Haq et al. (1995) 


Hepatitis B surface 
antigen 


Tobacco 


Immunogenicity 


0.007 


Mason et al. 
(1992) 


Cholera toxin B 


Tobacco 


Immunogenicity 


Not reported 


Hein et al. (1996) 


oc-Tricosanthin 
from TMW-U1 


Tobacco 


HIV therapies 


4-5 


Kumagai et al. 
(1993) 


Human hirudin 


Canola 


Thrombin 

inhibitor 


0.3 


Parmenter et al. 
(1995) 



- Plant cells carry out all of the posttranslational modifications required 
for optimal biological activity of proteins. 



3 Plant Expression Strategies 

Genetic transformation of the plant genome can be performed using Agro- 
bacterium-mediated transformation (Horsch et al. 1985; Koncz and Schell 
1986), micro-projectile bombardment (Sanford et al. 1987; Klein et al. 1992; 
Christou 1993), viral vectors (Grill 1993; Porta and Lomonossoff 1996) or 
transformation of protoplasts. The transfected plants can express the re- 
combinant protein either constitutively or in a tissue-specific manner. 
Alternatively, genetically modified plant viruses can be used. Plants are 
infected with the modified virus and act as viral culture vessels. With this 
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method large quantities of recombinant proteins can be generated quite 
rapidly, using relatively small numbers of plants. The expressed recombi- 
nant protein can be targeted to stable environments within the plant. 
Alternatively, tissue-specific promoters can be used to direct expression in 
storage organs such as seeds or tubers. Extraction and purification from 
these sites are simple. 

Deposition into the extracellular apoplast may contribute to the stability 
of recombinant proteins by removing them from the more hydrolytic 
intracellular environment (Firek et al. 1993). 

Two strategies have been used to produce recombinant protein in plant 
hosts. One is based upon the generation of transgenic plants by stable 
integration of a transgene in the plant genome (Cheng et al. 1997; Muller et 
al. 2001). The other uses plants as hosts for transient expression by using 
plant viruses as vectors (Brooks and Bruening 1995; Scholthof et al. 1996; 
Chowrira et al. 1998). Generating stable transgenic plants is more labor-in- 
tensive than generating recombinant virus vectors that can infect wild-type 
plants. On the other hand, transgenic plants do not require inoculation and 
can be obtained in very large amounts once a stable strain has been gener- 
ated. 

Stable transgenic plants can be used to produce leaves or seeds rich in 
the recombinant protein for long-term storage or direct processing (Hiatt 
et al. 1989; Pen et al. 1993; Fiedler and Conrad 1995; Stoger et al. 2000). 

The choice of promoter, which determines the timing, tissue specificity, 
and level of transgene expression, is important for product yields of the 
transgene and recovery (Chinn and Comai 1996). The most widely used 
promoter in plant biology for overexpression of plant or recombinant 
proteins is the constitutive 35S promoter derived from the cauliflower 
mosaic virus (Benfey et al. 1989; Fang et al. 1989). However, constitutive 
expression of proteins has significant limitations. High constitutive expres- 
sion is sometimes associated with cosuppression or gene silencing (Taylor 
1997), resulting in little or no transgene product accumulation. For proteins 
that are not highly stable, constitutive expression can lead to synthesis-deg- 
radation cycles and contamination of the final product with degradation 
products. Use of inducible or tissue-specific promoters can avoid some of 
these limitations (Cramer and Weissenborn 1997). 



3.1 Selection of Crop Species 

In order to select a certain plant species as expression system it is important 
to take into account how readily it can be manipulated to produce a stable 
transgenic line, which tissue and subcellular compartment is suited for a 
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stable expression, processing and stability of the recombinant protein. 
Some factors, like the amenability to transformation, the ability to regen- 
erate whole plants and generation time, have a significant impact upon the 
time and resources required for product development (Lindsey 1996). 

Tobacco, for example, is often used to test plant-based systems for 
production of recombinant proteins, because it can easily be genetically 
engineered, produces much biomass and seed (1 million seeds per plant). 
Other plants tested are potato, tomato, alfalfa, soybeans, canola, rice or 
barley (Zhu et al. 1994; Haq et al. 1995; Dieryck et al. 1997). 



3.2 Choice of Tissue 

For obtaining a maximum yield, it is important that the selected plant 
species is able to concentrate biomass in the tissue or organ where the 
foreign protein is expressed. Among higher plants a variety of options are 
available including vegetative storage organs (e.g., tubers), leaves and seeds, 
which contain large quantities of storage proteins and have a low hydrolytic 
intracellular environment. Seeds have the advantage over most other or- 
gans that they can be stored for several years without any appreciable 
degradation of proteins or loss of activity (Pen et al. 1993). Therefore, 
immediate processing is not required. 

The chosen tissue should enable correct processing, stable accumula- 
tion, and efficient recovery. Mammalian antibodies, for example, have been 
produced in tobacco leaves following trafficking through the endoplasmic 
reticulum and Golgi complex (Ma et al. 1995). In tobacco leaves and some 
tissues of potato human serum albumin, a blood protein, has also been 
stably expressed (Sijmons et al. 1990), although a precise folding and 
functionality of the protein could not be established. Hirudin, an antico- 
agulant protein, has been produced in seeds of Brassica napus followed by 
targeting to protein and oil bodies (Parmenter et al. 1995). 



4 Localization and Processing of Expressed Proteins 

Many human therapeutic proteins of interest require complex posttransla- 
tional processing for their biological activity or appropriate targeting fol- 
lowing administration to patients. The protein processing steps are very 
similar between plants and animals. Most human proteins that have been 
expressed in plants show remarkable biochemical, structural and func- 
tional identities to proteins from humans or animal cell cultures. 
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4.1 Posttranslational Processing 

Posttranslational processing of proteins includes endomembrane target- 
ing, cleavage of signal peptides, protein folding and oligomerization, gly- 
cosylation, disulfide bond formation (Fischer et al. 2000). 

A remarkable fidelity with which plants recognize and correctly act upon 
most of the processing signals within mammalian polypeptides exists, 
which indicates a high degree of conservation in the protein processing 
machinery between plants and animals. 

However, clear variations in protein processing, most obvious in glyco- 
protein processing, exist between plants and animals. This leads to the 
generation of glycans that are heterogenous or differ significantly from the 
native conformation of particular human proteins (Jenkins et al. 1996). The 
differences in glycosylation patterns of plant antibodies have no effect on 
antigen binding or specificity (Ma and Vine 1999). 

Modifying the glycoprotein processing of plants through genetic engi- 
neering or in vitro enzymatic modification of the purified recombinant 
protein should enable commercialization of recombinant plant-derived 
glycoproteins for pharmaceutical uses (Chrispeels and Faye 1996). 



4.2 Plant Purification Techniques 

Plant-derived recombinant proteins can be purified by standard biochemi- 
cal techniques (Gailliot et al. 1990; Fischer et al. 1999b). Impurities in 
plant-derived biopharmaceuticals may include biologically active plant 
metabolites or alkaloids, plant macro-molecules (like DNA, polysaccha- 
rides or lipids), herbicides and pesticides. Such impurities could affect the 
safety and efficacy of the product, including direct toxic effects on the 
recipient or effects on product stability. The relatively mild aqueous extrac- 
tion procedures often used to solubilize recombinant proteins might not be 
very effective to extract many potential harmful substances, compared with 
the organic solvent treatments often employed in the extraction of other 
pharmaceuticals. 

One method for the purification of recombinant proteins is the use of 
affinity tags. After the creation of a fusion between the interesting protein 
and another protein that exhibits affinity for a specific ligand, the fusion 
protein can be recovered by binding to the ligand immobilized onto a 
support matrix (Cramer et al. 1996). 

For some proteins the affinity tag can be proteolytically removed from 
the fusion protein after purification. However, it is possible that the tag 
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could alter folding or processing of the recombinant protein. In addition 
this strategy is not suitable for any large-scale production. 

Another method used to simplify the purification of recombinant pro- 
teins is through compartmentalization. Using a signal peptide, the interest- 
ing protein can be targeted to a specific cellular location (Herbers et al. 
1995). Purification of the desired recombinant protein is then facilitated 
through the possibility of its physical separation from other proteins in the 
cell. Examples for compartmentalization are expression on viral particles, 
extracellular secretion, and targeting to intracellular storage organelles 
(Parmenter et al. 1995; Boothe et al. 1997). 

An attractive feature of secretion is that the signal peptide is removed 
from the recombinant protein during the normal processing without intro- 
ducing additional proteolytic digestion steps. 



5 Production of Immunogenic Proteins 
in Transgenic Plants 

A considerable effort has focused on genetic engineering for the improve- 
ment of plant characteristics and agricultural properties, but plant biotech- 
nology has also been applied to the production of “high-value” products, 
such as pharmaceutical compounds and vaccines. This approach is attrac- 
tive, because it has the potential for growing immunotherapeutic reagents 
on an agricultural scale, thereby reducing the costs of production signifi- 
cantly. An important advantage of plants as a recombinant expression 
system is the ability to assemble full-length heavy chains with light chains 
to form full-length antibodies efficiently (Hiatt et al. 1989). In bacterial 
expression systems full-length antibodies are not readily assembled. 

Originally, foreign antibody genes were introduced into plant cells by 
non-pathogenic strains of the natural plant pathogen Agrobacterium tume- 
faciens (Horsch et al. 1985) and regeneration in tissue culture resulted in 
the recovery of stable transgenic plants. Although the initial work to gen- 
erate multichain proteins required crossing of plants expressing each chain, 
more recent studies have shown that multiple chains can be introduced via 
a single biolistic transformation event (Sanford 1988), greatly reducing the 
time to final assembly antibody. 

The use of edible plants for the production and delivery of vaccines could 
provide an economical alternative to fermentation systems. Genes encod- 
ing bacterial and viral antigens are expressed in edible tissues to form 
immunogenic proteins. Studies in humans and animals demonstrated the 
production of antigen-specific antibodies in serum and mucosal secretions 
after ingestion of transgenic plants containing vaccine proteins. 
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5.1 Antibody Targeting 

In general, the technology is limited by low expression levels for nuclear- 
integrated transgenes, but recent progress in plant organelle transforma- 
tion shows promise for enhanced expression. Transgenes can be fusioned 
with signal peptide sequences to enable entry of proteins into the secretory 
pathway (Hiatt et al. 1989; Bednarek and Raikhel 1992). In the secretory 
system complete antibodies and Fab fragments can be assembled. Single- 
chain (sc) Fv constructs can be functionally expressed in the cytoplasm 
without any signal sequence (Owen et al. 1992; Tavladoraki et al. 1993). It 
is possible to target them to different compartments through fusion of 
targeting signals to either terminus of the scFv construct (Chrispeels 1991). 



5.2 Optimization of Antibody Production 

In 1989, the production of antibody in transgenic plants was first described. 
It was demonstrated, that co-expression of two recombinant gene products 
could lead to a correctly folded and assembled multimeric molecule in 
plants that was functionally identical to its mammalian counterpart (Hiatt 
et al. 1989). Originally, only low expression levels were achieved in plants. 
Various strategies have been devised to improve yields, e.g. the expression 
of genes for antibody fragments in fusion with the endoplasmic-reticulum 
retention signal KDEL. This strategy can increase scFv yield tenfold 
(Schouten et al. 1996) or up to 4-6.8% of total soluble protein (Fiedler et al. 
1997). 

Another successful approach is the targeting of scFv for expression in 
seeds, where the antibody fragment can accumulate to 3-4% of the total 
soluble seed protein (Fiedler et al. 1997). 

Recombinant antibodies produced in tobacco plants have the same 
sensitivity, specificity and, importantly, the same affinity as monoclonal 
antibodies produced by the original hybridoma cell line (Hein 1996). 



5.3 Developments in Plant-Derived Vaccines 

Vaccines are aimed to prime the immune system to destroy specific dis- 
ease-causing pathogens or agents, before they can multiply enough to cause 
symptoms. This has classically been achieved by presenting whole bacteria 
or viruses to the immune system that have been attenuated or killed to 
prevent their multiplication in the vaccinated person. 
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The first reported vaccine candidate to be expressed in transgenic plants 
was the cell surface-adhesion protein Spa A or streptococcal antigen I/II 
from Streptococcus mutans (Curtiss and Cardineau 1990). This is the major 
bacterial cause of tooth decay, and there is a considerable evidence that 
demonstrates protective immunity against dental caries following systemic 
or oral immunization with this antigen. The spa A gene was introduced into 
tobacco by agrobacterium-mediated transformation, and the protein was 
expressed at levels of up to 0.02% total leaf protein. 

Hepatitis B surface antigen (HbsAg) has been expressed in transgenic 
potatoes, in addition to a chimeric gene encoding the M protein of hepatitis 
B virus. For oral vaccination with edible, transgenic plant material, potato 
may not be ideal, because the cooking process might denature the recom- 
binant protein. Alternatively tomato was used for expression of the rabies 
virus glycoprotein (McGarvey et al. 1995). Using the complete unmodified 
gene under the control of the 35S promoter of cauliflower mosaic virus, 
expression of glycoprotein was found in leaf and fruit tissue. The tomato- 
expressed glycoprotein was recognized by specific antisera and a mono- 
clonal antibody, suggesting preservation of important epitopes. 

However, a persistent problem in the expression of vaccines in trans- 
genic plants has been the low levels of expression, although several ap- 
proaches are being adopted to overcome this technical problem. The excep- 
tion is antibodies for which levels of 1-5% total protein are achieved 
consistently (Mason and Arntzen 1995; Daniell et al. 2001). 



6 Regulatory and Safety Issues 
6.1 General Concerns 

The use of plants avoids many of the potential safety issues associated with 
contaminating mammalian viruses, as well as ethical concerns involving 
the use of animals. An important benefit of plants involves product safety. 
Plants are free from known human pathogens like HIV and also from 
unknown or poorly characterized agents as the prions that are responsible 
for bovine spongiform encephalopathy and the related Creutzfeld- Jakob 
disease (Roberts and James 1996; Vaughan 1996). Plant viruses, used in 
virus-based transient expression systems, are not known to infect humans 
or animals. 




188 



Genetics 



However, there are also some unresolved issues: 

- How to easily identify seed or fruits from transgenic plants expressing 
therapeutic proteins? This is important as these are not acceptable in the 
food chain. 

- Another issue is regarding the amount of antibodies necessary for an oral 
vaccination, as it is difficult to guarantee a certain level of antigen 
protein in a specific plant tissue. 



6.2 Excision of Selectable Marker Genes from Transgenic Plants 

Selectable marker genes are required to ensure the efficient genetic modi- 
fication of crops. Safety considerations and economic incentives have 
prompted the development of several strategies (homologous recombina- 
tion, site-specific recombination, and transposition) to eliminate these 
genes from the genome after they have fulfilled their purpose (Dale and Ow 
1991; Fig. 1). Recently, chemically inducible site-specific recombinase sys- 
tems have emerged as valuable tools for an efficient regulation of the 
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Fig. 1 A-C. Excision of marker genes. Use of l^ie Cre/lox system to remove marker genes from 
transgenic plants. A The marker gene (kan ) and the CreA gene (shown as rectangles) are 
localized between two lox sequences (shown as triangles). The CreA gene is driven by an 
inducible promoter (iP). B Upon induction of the promoter, the CREA protein specifically 
recognizes the lox DNA sequences and the sequences between will be excised; C a single lox 
sequence remains 
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excision of transgenes when their expression is no longer required (Odell 
et al. 1994; Hajdukiewicz et al. 2001; Zuo et al. 2001). The implementation 
of these strategies in crops and the further improvement will help to 
expedite widespread public acceptance of agricultural biotechnology. 



7 Outlook 

Plant biotechnology is a rapidly expanding area. Its further development 
will bring many therapies that are now too expensive for wide use into the 
hands of most medical practitioners. One of the most obvious advantages 
of plants is the potential for scale-up production, in which large amounts 
of recombinant protein could be grown at minimal cost. However, some 
issues have not yet been properly addressed, i.e. how to comply with the 
requirements of “good manufacturing practice” (GMP), which is required 
for all human therapeutic proteins. 

Advancing technology, such as oral DNA vaccines or edible plant-de- 
rived vaccines, may lead to a future of safer and more effective immuniza- 
tion. It seems likely that sufficient antibody could be “grown” in plants, for 
most medical applications, on only a few acres of land and at minimal cost 
relative to alternative production methods. 

It is anticipated that the advances in the genetic engineering of plants 
and the demand for large quantities of new or improved therapeutics and 
diagnostics in the pharmaceutical markets will lead to a rapid expansion in 
this field. 
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Plant Breeding: 

The ABCs of Flower Development in Arabidopsis 
and Rice 

Gunter TheiCen and Annette Becker 



1 Introduction 

1.1 Functional Phylogenomics of Flower Development 

Understanding the genetic basis of flower formation has been pioneered in 
higher eudicotyledonous flowering plants such as thale cress ( Arabidopsis 
thaliana ), snapdragon (Antirrhinum majus) and petunia ( Petunia hybrida) 
(for a review, see TheiCen and Saedler 1999). A deeper understanding of 
flower origin and diversification, however, requires studies in phylogenetic 
informative taxa outside the higher eudicots, including basal eudicots, 
monocots and basal angiosperms (Soltis et al. 2002). Such studies will 
comprise ‘phylogenomic’ approaches on gene families of interest (such as 
the MADS-box gene family), comparative EST sequencing and expression 
studies in a huge number of taxa (Soltis et al. 2002). These efforts, however, 
will allow only quite limited conclusions on the phylogeny of gene functions 
and thus should be complemented by comparative developmental genetics 
studies (Baum et al. 2002), implying that suitable model systems have to be 
established for different key non-eudicotyledonous flowering plant taxa. 
This will be a laborious effort, since very few plant species are amenable to 
the required technology (such as transformation). 

Fortunately, nature and a strong human interest in agronomical impor- 
tant plants have provided us already with an almost ideal model system for 
the monocots: rice ( Oryza sativa). Studies on flower development in rice 
are strongly catching up with those in Arabidopsis , so that comprehensive 
comparisons between both systems can start now, as exemplified below for 
the floral organ identity genes. 



1.2 Arabidopsis and Rice: The Two Most Important Model Plants 

For quite some time the tiny weed Arabidopsis thaliana (in brief, Arabidop- 
sis) is serving now as the major model system for understanding gene 
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functions in flowering plants. This is so due to a number of unrivaled 
advantages such as small genome size, short life cycle, low requirements for 
growth space, and easy transformability. Based on these features, a number 
of useful tools have been developed, such as transposon and T-DNA tagged 
populations for reverse genetics screens, and the genome has been almost 
completely sequenced (Arabidopsis Genome Initiative 2000). Therefore, 
Arabidopsis will certainly play its role as the primary plant model system 
also for quite some years to come. 

Meanwhile, however, the genome of a second flowering plant, rice 
( Oryza sativa) has also been almost completely sequenced (Feng et al. 2002; 
Goff et al. 2002; Sasaki et al. 2002; Yu et al. 2002). Although rice has a genome 
size which is roughly about four times larger than that of Arabidopsis and 
has a longer life cycle, it has features which still qualify it as a reasonably 
good model system, and powerful tools for functional genomics are mean- 
while also available for that species (Shimamoto and Kyozuka 2002). 

Employing rice as a second major plant model system for genetic and 
functional genomics studies appears very much appropriate for a number 
of reasons. 

- In sharp contrast to Arabidopsis , rice is of enormous agronomic impor- 
tance. In fact, rice, with its two subspecies Oryza sativa ssp. indica and 
Oryza sativa ssp.japonica , is the major staple food for half of the world’s 
population and belongs to the three most important crop plants on a 
global scale (besides wheat and maize). Moreover, rice is relatively 
closely related to all the other agronomically important cereal grasses, 
including not only wheat ( Triticum aestivum) and maize (Zea mays ssp. 
mays), but also barley ( Hordeum vulgare ), rye ( Secale cereale ), oat 
(Avena sativa ), and sorghum ( Sorghum bicolor). The rice genome is 
largely colinear with those of all of these grasses, so that it is considered 
as the reference genome for the world’s most important crop plants 
(Shimamoto and Kyozuka 2002). A transfer of knowledge from Arabi- 
dopsis to rice, and a detailed study of rice genetics and physiology may 
thus be very helpful for future plant breeding, including a future design 
of cereal crops by transgenic technology, or mutagenesis and marker 
assisted breeding (Meyerowitz 1994; Theifien 2000, 2001c, 2002a). 

- The lineages that led to Arabidopsis (a eudicot) and rice (a monocot) 
separated quite early during flowering plant evolution, i.e., according to 
molecular data, about 200 million years ago (TheiBen et al. 2000, and 
references cited therein). Thus a comparison between the functions of 
orthologous genes in rice and Arabidopsis promises a fairly good im- 
pression about both the conservation and divergence of gene functions 
throughout most of flowering plants. 
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- Rice has some physiological and morphological features which are quite 
different from those of Arabidopsis. For example, rice is a short-day 
plant rather than a facultative long-day plant like Arabidopsis , and it has 
a highly derived floral morphology. It is difficult, therefore, to generalize 
findings from Arabidopsis for the flowering plants in general (although 
this is often done, explicitly or implicitly) without looking at lineages 
that are so deviant like rice, representing the grasses (Poaceae). 



1.3 The Flowers of Arabidopsis and Rice 

The flowers of Arabidopsis and rice appear quite different. While for the 
reproductive organs, i.e., stamens and carpels, homology between Arabi- 
dopsis and rice is unquestionable, it is still quite controversial for other 
organs. An Arabidopsis flower is composed of (from inner to outer whorls) 
two fused carpels, being the female reproductive organs that constitute the 
gynoecium; six stamens, being the male reproductive organs; four white, 
showy petals; and four green, leaf-like sepals. 

The tiny, wind-pollinated flowers of the grasses (Poaceae) are quite 
different from the flowers within other flowering plant families. They have 
carpels and stamens like their eudicot relatives, but they lack typical petals 
and sepals. Instead, lodicules, palea and lemma surround the carpels and 
stamens, thus constituting structures called florets. One or several florets 
are enclosed by typically two glumes, thus constituting structures called 
spikelets. Lodicules are small, glandular-like organs that swell at anthesis 
to spread the lemma and palea apart so that the wind can disperse the pollen 
produced by the stamens. According to their position in the flower, 
lodicules might be petal homologues. Palea, lemma and glumes are leaf-like 
organs. Palea and lemma could be homologues of eudioct sepals; alterna- 
tively, the palea may represent a prophyll, and the lemma a true bract 
(Clifford 1987). 

In case of rice there is just a single carpel in the center of the flower; it is 
surrounded by six stamens, two lodicules, a palea and a lemma, and an 
upper and a lower sterile glume (Shimamoto and Kyozuka 2002). 



1.4 The Developmental Genetic Context of Floral Organ Formation 

Right from the beginning, flower development is under strict genetic con- 
trol. This guarantees that flower development is initiated only under con- 
ditions which are favorable for reproduction, but that it, once it started, 
proceeds in a highly standardized way. Flower development can be subdi- 
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vided into several major steps, such as floral induction, floral meristem 
formation, and floral organ development. Genetic control of the different 
steps of flower development is achieved by a hierarchy of interacting 
regulatory genes, most of which encode transcription factors (for a brief 
review, see Theifien 2001a). Near the top of that hierarchy are ‘flowering 
time genes’ which are triggered by developmental cues and environmental 
factors such as plant age, day length and temperature. ‘Flowering time 
genes’ mediate the switch from vegetative to reproductive development by 
activating ‘meristem identity genes’. ‘Meristem identity genes’ control the 
transition from vegetative to inflorescence and floral meristems. When the 
transition from inflorescence to floral meristems has taken place, floral 
organs arise at defined positions from within these meristems under the 
control of different types of genes. ‘Floral meristem size genes’ regulate the 
size of the floral meristem and also influence floral organ number. ‘Cadas- 
tral genes’ are involved in setting the boundaries of floral organ identity 
gene functions. ‘Floral organ pattern genes’ act to establish floral organ 
primordia in specific numbers and positions. These primordia develop into 
the different types of floral organs under the control of specific homeotic 
selector genes, termed ‘floral organ identity genes’. 

The function of ‘floral organ identity genes’ was recognized during the 
study of homeotic mutants in which the identity of floral organs is changed. 
In Arabidopsis such mutants come in three classes, A, B and C. Ideal class 
A mutants have carpels in the first whorl instead of sepals, and stamens in 
the second whorl instead of petals. Class B mutants have sepals rather than 
petals in the second whorl, and carpels rather than stamens in the third 
whorl. Class C mutants have petals instead of stamens in the third whorl, 
and replacement of the carpels in the fourth whorl by sepals. In addition, 
these mutants are indeterminate, i.e. there is continued production of 
mutant floral organs inside the fourth whorl. 

Based on these classes of mutants the ‘ABC model’ proposes three classes 
of combinatorially acting ‘floral organ identity genes’, called A, B and C, 
with A specifying sepals in the first floral whorl, A+B petals in the second 
whorl, B+C stamens in the third whorl, and C carpels in the fourth whorl. 
The model also maintains that the class A and class C genes negatively 
regulate each other. Based on studies in petunia ( Petunia hybrida ), the ‘ABC 
model’ was later extended by class D genes, specifying ovules. Meanwhile, 
it has been demonstrated by a reverse genetic approach that yet another 
class of ‘floral organ identity genes’, termed class E genes, is involved in 
specifying petals, stamens and carpels. The ‘floral organ identity genes’ can 
be interpreted as acting as major developmental switches that activate the 
entire genetic program for a particular organ (for a description of the 
ABCDE model, see TheiCen 2001b). 
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In Arabidopsis , class A genes comprise APETALA1 (API) and 
APETALA2 ( AP2 ). The class B genes are represented by APETALA3 ( AP3 ) 
and PISTILLATA (PI), and the class C gene is AGAMOUS (AG). Class D 
genes have been recognized only in petunia so far, where they have been 
termed FLORAL BINDING PROTEIN7 (FBP7) and FBP11 , but there is a 
putative ortholog in Arabidopsis termed AGL11. The class E genes in 
Arabidopsis comprise SEPALLATA1 (SEP1), SEP2 and SEP 3, which have 
highly redundant functions. All these genes encode putative transcription 
factors. Thus the products of the ‘floral organ identity genes’ probably all 
control the transcription of other genes (largely unknown ‘target genes’) 
whose products are involved in the formation or function of the different 
floral organs. Except for AP2 , all ‘floral organ identity genes’ are MADS-box 
genes, encoding MADS-domain transcription factors (TheiBen 2001b). 

Phylogeny reconstructions revealed that the MADS-box gene family is 
composed of several defined gene clades (Doyle 1994; Purugganan et al. 
1995; Theifien et al. 1996, 2000). All class A, B, C, D and E genes identified 
from eudicots fall into separate clades, namely SQUAMOSA- (class A), 
DEFICIENS- or GLOBOSA- (class B), AGAMOUS- (class C and D ) and 
AGL2-like genes (class E) (Doyle 1994; Purugganan et al. 1995; Theifien and 
Saedler 1995; Angenent and Colombo 1996; Theifien et al. 1996, 2000; 
Munster et al. 1997). If the establishment of the mentioned gene clades by 
gene duplication and the structural and functional diversification that 
established the different gene functions predated the radiation of crown 
angiosperms, class A, B, C, D and E genes from non-eudicots should also 
be found in these gene clades. This can be tested now in non-eudicot model 
systems such as rice. Such tests are not always straight forward, however, 
because of the unclear homology between some organs in the flowers of 
Arabidopsis and rice. 



2 Floral Organ Identity Genes in Arabidopsis and Rice 
2.1 SQUA-Like Genes (Class A Candidate Genes) 

There are three SQUA-like genes in the Arabidopsis genome, termed 
APETALA1 (API), CAULIFLOWER (CAL) and AGL8 (Table 1; Fig. 1) (Man- 
del et al. 1992; Kempin et al. 1995; Gu et al. 1998). Only API is involved in 
specifying sepal and petal identity and thus, by definition, works as a class 
A floral organ identity gene (Mandel et al. 1992). In fact, API from Arabi- 
dopsis is the only MADS-box gene so far where a class A gene function can 
be genetically separated from other functions, such as specifying floral 
meristem identity. 
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Plants homozygous for mutations in CAL are phenotypically wild type, but together with 
mutations in API , the typical ‘cauliflower' phenotype occurs, which is characterized by an 
extensive proliferation of meristems at each position that in wild type would give rise to a 
single flower (Kempin et al. 1995). These data indicate that CAL and API encode partially 
redundant functions in the specification of floral meristem identity. According to its single 
mutant phenotype AGL8 is required for the normal pattern of cell division, expansion, and 
differentiation during morphogenesis of the silique, and was therefore renamed FR UITFULL 
( FUL ) (Gu et al. 1998). The major part of the silique is provided by the carpel valves, and it 
appears that FUL is a valve identity gene (Gu et al. 1998). Although /w/ single mutants do 
not have an inflorescence-specific phenotype, it is also one of the first genes expressed after 
the transition to flowering at the inflorescence apex. In line with this, apical ful triple 
mutants show an extreme enhancement of the apl cal phenotype, indicating partial func- 
tional redundancy between the API, CAL and FUL genes during flower initiation (Ferrandiz 
et al. 2000). 

Phylogeny reconstructions suggest that the ancestral function of SQ t/A-like 
genes was in specifying inflorescence or floral meristem identity, which is 
maintained in many extant genes, and that functions in specifying sepal and 
petal, or fruit valve identity are probably derived (Theifien 2000; TheiCen 
et al. 2000). Therefore, one may expect SQLM-type floral meristem identity 
genes in rice, but not necessarily class A gene functional equivalents. 

Three different SQUA -like genes have been identified in the rice genome 
so far, termed OsMADS14 , OsMADS15 and OsMADS18 (Table 1, Fig. 1). A 
number of other sequences published under other names probably repre- 
sent the same genes; a few differences in sequences, if any, may reflect 
sequencing errors or allelic diversity, especially since sequences have often 
been obtained from different rice subspecies ( japonica and indica ). 

OsMADSM (Moon et al. 1999b) represents very likely the same genes as FDRMADS6 (Jia et 
al. 2000) and RAP1B (Kyozuka et al. 2000). OsMADSM expression is confined to inflores- 
cences. Expression occurs throughout the spikelet meristem. After organ primordia have 
been generated, OsMADSM expression becomes restricted to the sterile glumes, palea and 
lemma. Surprisingly, in the mature spikelet, OsMADSM expression is shut off in the sterile 
organs of the spikelet, and switches to the reproductive organs, i.e., stamens and carpels (Jia 
et al. 2000; Pelucchi et al. 2002). 

OsMADSM (Lim et al. 2000) is very likely the same gene as RAP1A (Kyozuka et al. 2000). 
OsMADSM is expressed throughout the spikelet meristem. After initiation of the different 
organs of the spikelet transcript accumulation becomes restricted to the non-reproductive 
organs, i.e., lodicules, palea, lemma, and empty glumes, with only a weak expression in the 
latter organs (Kyozuka et al. 2000). SQt/A-like genes with a similar expression pattern have 
also been found in maize (Munster et al. 2002). 

OsMADSM (Moon et al. 1999b) is probably allelic with FDRMADS7 (Jia et al. 2000) and 
OsMADS28. In contrast to the other rice SQ l/A-like genes, it has a quite ubiquitous expres- 
sion in many rice organs and tissues. Within spikelets, OsMADSM is expressed in all organs 
except for very late stages, where transcript accumulation is not detectable in lodicules and 
sterile glumes (Fornara et al. 2003). 

The question arises as to whether rice has a class A floral homeotic gene as 
defined by function (i.e., independent of gene relationships to class A genes 
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from eudicots). Since grasses like rice have no sepals and petals, they should 
not be expected to have class A floral organ identity genes in a strict sense. 
And even in eudicots a function in the specification of perianth organ 
identity is hard to separate from evolutionary probably more ancient 
functions in providing meristem identity (TheiCen et al. 2000). Neverthe- 
less, if we assume that palea and lemma are homologues of eudicot sepals, 
and lodicules are petal homologues, some SQUA - like gene might be in- 
volved in the specification of these organs in rice. Unfortunately, loss-of- 
function mutants for any of the rice SQUA-like genes have not been re- 
ported yet. If we use the mRNA expression patterns as criteria, OsMADS15 
might be the best candidate for a class A-like gene involved in specifying 
the identity of lodicules, palea and lemma. 



2.2 DEF-Like and GIO-Like Genes (Class B Candidate Genes) 

In higher eudicots, including Arabidopsis , class B floral organ identity genes 
are involved in specifying the identity of petals and stamens. Class B gene 
loss-of-function mutants have petals replaced by sepals, and stamens by 
carpels. Studies on extant gymnosperms revealed that class B genes may 
have an ancestral function in specifying male reproductive organs (where 
B gene expression is on) and distinguishing them from female reproductive 
organs (B gene expression off) that was probably established already in the 
last common ancestor of extant seed plants about 300 million years ago 
(Albert et al. 1998; Mouradov et al. 1999; Sundstrom et al. 1999; Winter et 
al. 1999, 2002a; Fukui et al. 2001). After the establishment of distinct DEF- 
and GLO-like genes by gene duplication during angiosperm evolution, they 
were recruited also for specifying petal identity, and their gene products 
evolved from homodimerizing into obligatory heterodimerizing proteins 
(Winter et al. 2002b). 

There is just one DEF-like and one GLO-like gene in the Arabidopsis 
genome, termed APETALA3 ( AP3 ) and PISTILLATA (PI), respectively (Ta- 
ble 1, Fig. 1; Jack et al. 1992; Goto and Meyerowitz 1994). Two GLO-like 
genes, termed OsMADS2 and OsMADS4 , and one DEF-like gene, Os- 
MADS16 , have been reported so far from rice (Chung et al. 1995; Moon et 
al. 1999a). 

The presence of a paralogous pair of GLO-like genes is not a speciality of rice, but traces 
back to a gene duplication that predates the separation of the lineages that led to extant 
maize and rice 50-70 million years ago (Munster et al. 2002). OsMADS2 , OsMADS4 and 
OsMADS16 are expressed in lodicules and stamens. Whether OsMADS2 and OsMADS4 are 
also expressed in carpels is controversial (compare results from Chung et al. 1995, with 
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Kyozuka et al. 2000), but mRNA accumulation in carpels has also been observed for the 
GLO-like genes of maize (Munster et al. 2001). 

Transgenic plants expressing antisense RNA of OSMADS4 exhibited homeotic transfor- 
mations of lodicules into palea/lemma-like structures and stamens into carpel-like organs 
(Kang et al. 1998). A very similar phenotype was observed in mutants termed superwomanl 
( spwl ), which are affected in OsMADS16 (Nagasawa et al. 2003), and in the silky 1 mutant, 
in which the DEF - like gene from maize has lost its function (Ambrose et al. 2000). These 
data indicate that the DEF- and GLO-like genes from grasses are involved in specifying 
stamen and lodicule identity and thus suggest that considerable aspects of the DEF- and 
GLO-like gene function have been conserved throughout monocot and eudicot evolution. 
This is especially obvious for the function in the specification of male reproductive organs 
(stamens), which was very likely established in the most recent common ancestor of 
angiosperms and gymnosperms already (Theifien et al. 2000; Winter et al. 2002a); there is 
hardly any doubt that stamens are homologous organs throughout angiosperms. 

Since in eudicot class B gene mutants petals are replaced by sepals, the 
loss-of-function phenotypes of DEF- and GLO-like genes in rice and maize 
have been used as evidence that lodicules are homologous to petals, and 
palea/lemma are homologous to sepals (Kang et al. 1998; Ambrose et al. 
2000). While this is certainly a likely possibility, things are unfortunately 
not necessarily so straightforward. It could also be that orthologous DEF- 
and GLO-like genes have been recruited more than once for the specifica- 
tion of organs that originated independently in evolution and are thus not 
homologous. There is evidence that eudicot-type petals originated from 
stamens at the base of extant higher eudiots (and are hence termed ‘an- 
dropetals’) (Kramer and Irish 2000); thus, lodicules could have been de- 
rived from stamens independently in the lineage that led to extant mono- 
cots. This would also nicely explain why DEF- and GLO-like genes are 
involved in the specification of both kinds of organs. 

Taken together, it is quite clear now that grasses have orthologs of both 
types of eudicot class B genes (DEF- and GLO-like genes), which share 
highly conserved functions in the specification of male reproductive organs 
with eudicot DEF- and GLO-like genes. Like their eudicot counterparts, 
being involved in the specification of petals, these genes are also involved 
in specifying the identity of the organs immediately outside of the stamens, 
i.e., the lodicules. Whether lodicules are homologous to petals, however, is 
not finally clear. The DEF - and GLO-like genes of grasses, including rice, 
have thus certainly a class B-like function, but the extent to which this 
function is related to that of eudicot class B genes is not fully understood. 



2.3 AG-Like Genes (Class C and Class D Candidate Genes) 

There are four different AGAMOUS- ( AG- ) like genes in the Arabidopsis 
genome, AG, AGL11 , SHATTERPROOF 1 ( SHP1 ) and SHP2 (Table 1, Fig. 1) 
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(Yanofsky et al. 1990; Ma et al. 1991; Rounsley et al., 1995). AG is a typical 
class C floral homeotic gene; it is involved in specifying stamen and carpel 
identity, and in providing floral determinacy. SHP1 and SHP2 (formerly 
known as A GL1 and AGL5 , respectively) encode functionally redundant 
proteins that are required for the proper development of the fruit dehis- 
cence zone of Arabidopsis (Liljegren et al. 2000). AGL11 appears to be 
orthologous to the class D floral homeotic genes FBP7 and FBP1 1 from 
petunia ( Petunia hybrida ), which are master control genes of ovule identity 
(Angenent and Colombo 1996). In line with this, these genes are mainly 
expressed in developing ovules, which is also the case for AGL11. Thus 
AGL11 may also be involved in the specification of ovule identity. 

Two AG-like genes have been found in the rice genome, termed Os- 
MADS3 (Kang et al. 1995) and OsMADS13 (Lopez-Dee et al. 1999). Os- 
MADS3 is very likely allelic to RAG1 (Kyozuka et al. 2000). OsMADS3 is 
expressed in both the stamens and carpels of rice (Kang et al. 1995; Kyozuka 
et al. 2000), while OsMADS13 transcription is confined to developing ovules 
and the inner layer of the carpel wall (Lopez-Dee et al. 1999). These data 
suggest that OsMADS3 represents a class C floral organ identity gene, while 
OsMADS13 is a class D gene. Further studies enforce a more differentiated 
view, however. 

In rice plants in which OsMADS3 expression was silenced by an antisense approach, 
incomplete transformations of stamens into lodicules were observed, while carpels were 
replaced by abnormal flowers with undifferentiated stamens and carpels (Kang et al. 1998). 
These observations are compatible with a class C function of OsMADS3 , assuming that there 
is functional redundancy in the rice genome, or, more likely, that OsMADS3 gene function 
was not completely abolished in the antisense lines. However, Kang et al. (1998) observed 
also that lodicules were transformed into several types of unusual organs, including stamen- 
like structures, in the OsMADS3 antisense lines. This may suggest that OsMADS3 is also 
required for proper lodicule development, which would go beyond the classical class C floral 
homeotic function in stamens and carpels. Ectopic expression of OsMADS3 in rice, but also 
in tobacco leads to the transformation of lodicules or petals, respectively, into stamen-like 
organs (Kang et al. 1995; Kyozuka and Shimamoto 2002). Thus, gain-of-function studies 
corroborate the view based on loss-of- function analyses that OsMADS3 specifies reproduc- 
tive organ identity. 

Despite its expression in rice wild-type ovules, ectopic expression of Os- 
MADS13 in rice and Arabidopsis did not generate ectopic ovules; however, 
the same is also true for FBP7 and FBP1 1 in Arabidopsis (Favaro et al. 2002). 
Like the FBP7 and FBP11 proteins, however, OsMADS13 also interacts 
specifically with AGL2-like MADS-domain proteins, as revealed by yeast 
two-hybrid analyses, suggesting that FBP7/FBP11 and OsMADS13 share a 
conserved function. In line with its expression pattern, OsMADS13 may 
thus also have a class D function in the specification of ovules. 




202 



Genetics 



Phylogeny reconstructions (G. Theifien and A. Becker, unpubl.) suggest, however, that this 
class D function in monocots was not directly derived from the class D gene function 
represented in extant eudicots. Members of the clade of AG-like genes have been found in 
all major groups of seed plants (angiosperms, gnetophytes, conifers, cycads and Ginkgo ), 
but not in nonseed plants so far (Theifien et al. 2000 and unpubl. data), suggesting that this 
clade was established 300-400 million years ago (Theifien et al. 2000). In diverse groups of 
gymnosperms AG-like genes are expressed in male and female reproductive organs (Tandre 
et al. 1995, 1998; Rutledge et al. 1998; Winter et al. 1999), which may thus represent the 
ancestral state of gene expression. It has been suggested that the ancestral function of these 
genes may be to specify reproductive organs (both male and female), and to distinguish 
them from nonreproductive organs (Haughn and Somerville 1988; Winter et al. 1999). A 
gene duplication close to the base of angiosperms may have led to the establishment of two 
clades: ( 1 ) a clade of ovule-specific genes, represented by the class D genes of extant eudicots, 
probably including AGL11 from Arabidopsis ; (2) and a clade of genes which remained 
expressed in both male and female reproductive organs. Members of this latter clade were 
recruited for specification of the carpel, and thus gave rise to the class C floral homeotic 
genes. According to phylogeny reconstructions, it appears likely that the ancestral ovule- 
specific genes have been lost in the lineage that led to monocots after the separation from 
the lineage that led to extant eudicots about 160-200 million years ago. Ovule- or carpel- 
specific angiosperm genes (monocot class D genes) may have originated a second time by 
splitting from the class C gene lineage close to the base of monocots. Thus, although there 
are ovule- or carpel-specific AG-like genes in both eudicots (e.g., FBP7) and monocots 
( OsMADS13 , ZAG2 , ZMM1) (Schmidt et al. 1993; Theifien et al. 1995; Lopez-Dee et al. 1999), 
phylogeny reconstructions suggest that they are not monophyletic, but that such genes have 
been lost and reestablished in the lineage that led to extant monocots. However, because of 
uncertainties of previous phylogeny reconstructions, monophyly of class D genes cannot be 
completely ruled out. 

Taking together, some AG-like genes in angiosperms, such as AG, have 
probably an ancestral function in specifying both male and female repro- 
ductive organs, and some others, including AGL11 , SHP1 and SHP2 , have 
derived or specialized functions restricted to female reproductive organs 
(including ovules and fruits) (Theifien 2000). Similarly, OsMADS3 function 
may resemble more closely the broad ancestral function in the specification 
of both male and female reproductive organs; after a gene duplication 
subfunctionalization may have led to a ‘reinvention’ of an ovule-specific 
gene ( OsMADS13 ), while the gene lineage that led to OsMADS3 maintained 
its functions in carpels and stamens. The function of class D genes in ovules 
could also well be quite ancient (since gymnosperms have ovules, but not 
carpels), but at gymnosperm level it was not and is not separated from an 
additional function of AG-like genes in male reproductive organs. Thus the 
grasses, including rice, may have class C genes which are orthologous to 
those of the eudicots, while they may have genes with class D functions in 
ovule development that are members of the same gene subfamily, but which 
are possibly not orthologous to their putative eudicot functional equiva- 
lents (Table 1; Fig. 1). 
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2.4 AGL2 - Like Genes (Class E Candidate Genes) 

There are four AGL2-like genes in the Arabidopsis genome, termed AGL2, 
AGL3, AGL4 and AGL9 (Table 1, Fig. 1; Ma et al. 1991; Mandel et al. 1998). 
While AGL3 is expressed in all major plant organs above ground (Huang et 
al. 1995), expression of the other genes is restricted to second, third and 
fourth whorl floral organ primordia ( AGL9 ), or to floral organs of all four 
whorls (AGL2 and AGL4) (Flanagan and Ma 1994; Savidge et al. 1995; Mandel 
and Yanofsky 1998). The similarity in both sequences and expression pat- 
terns between the AGL2-like genes from Arabidopsis (except AGL3) sug- 
gested that they encode proteins of redundant functions, and indeed, single 
gene mutations generated by reverse genetics produced only very weak 
phenotypes (Pelaz et al. 2000). However, in triple mutants where AGL2, 
AGL4 as well as AGL9 had lost their function, all flower organs resembled 
sepals, and the flowers became indeterminate (Pelaz et al. 2000). These 
findings indicate that A GL2/4/9 have redundant functions required for petal, 
stamen and carpel development, and to prevent the indeterminate growth 
of the flower meristem. The triple mutant phenotype prompted Pelaz et al. 
(2000) to rename the genes SEPALLATA1 (S£PI), SEP2 and SEP3 , respec- 
tively, meaning Tots of sepals’. The sepl/213 triple mutant phenotype is very 
similar to that of double mutants in a class B and a C class floral homeotic 
gene. However, SEP1I2I3 are still expressed in B and C loss-of-function 
mutants, and the initial expression patterns of B and C class genes are not 
significantly altered in the sep 1/2/3 triple mutant. This indicates that SEP 1/2/3 
do not act downstream of the class B and class C genes, and that they are not 
required for the initial activation of these genes (Pelaz et al. 2000). 

These data suggest that the SEP genes constitute an additional class of 
floral homeotic genes, termed class E genes (Theifien 2001b). The SEP 
proteins constitute higher order complexes together with class A, B or C 
proteins, and certain versions of these complexes are obviously sufficient 
to transform leaves into floral organs; specifically, ectopic expression of 
class A or class E ( SQUA - or AGL2-like) and class B ( DEE - and GLO-like) 
genes results in the transformation of vegetative leaves into petals, while 
the expression of class B, class C (AG-like) and class E genes results in the 
transformation of leaves into stamen-like organs (Honma and Goto 2001). 
Therefore, the gene-based combinatorial ABC model of floral organ iden- 
tity was refined to a protein-based combinatorial ‘floral quartet model’ 
(TheiBen 2001b; Theifien and Saedler 2001). The ‘floral quartet model’ 
explains how the different floral organ identity genes interact at the mo- 
lecular level, namely via their gene products, which form quaternary, DNA- 
binding transcription factors that may bind to two ds-regulatory elements 
termed CArG-boxes (consensus: 5’-CC(A/T)6GG-3’). 
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Fig. 1. Phylogenetic tree showing the relationships between AG-like, AGL2-like, DEF-like, 
GLO-like and SQUA-like MADS-domain proteins from Arabidopsis and rice. The tree was 
constructed using conceptual MIK-domain sequences aligned by CLUSTALX and the neigh- 
bor-joining algorithm. OSMADS sequences are from rice (Oryza), all others from Arabidop- 
sis. The numbers next to some nodes give bootstrap percentages, shown only for relevant 
nodes and those defining gene subfamilies. Subfamilies, which generally represent mono- 
phyletic gene clades (Theifien et al. 1996; Munster et al. 1997), are labeled by brackets at the 
right margin 



AGL2-like genes in grasses show relatively heterogeneous expression 
patterns, strongly suggesting that they are also functionally not a homoge- 
neous class of genes. It seems that this subfamily underwent a relatively 
rapid functional diversification within the grasses, which possibly reflects 
the evolution of complex inflorescence structures (Theifien et al. 1996; 
Cacharron et al. 1999). 

Five different AGL2 - like gene have been identified in the rice genome, 
termed OsMADS 1, OsMADS5 , OsMADS24 , OsMADS34 and OsMADS45 
(Chung et al. 1994; Greco et al. 1997; Kang and An 1997; Pelucchi et al. 2002). 
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OsMADS7 and OsMADS8 (Kang et al. 1997) are allelic to OsMADS45 and 
0sMADS24, respectively (Table 1); OsMADSl corresponds to the leafy hull 
sterilel ( Ihsl ) mutant of rice (Jeon et al. 2000). For each of the five different 
genes mentioned, one or two putative orthologs from maize have been 
identified, indicating that the most recent common ancestor of maize and 
rice about 50-70 million years ago had already at least five different AGL2- 
like genes (Munster et al. 2002). 

Only for one AGL2-like gene from rice a mutant analysis has been carried out so far. Ihsl 
rice plants in which OSMADS1 is mutated often develop several florets within the spikelets, 
while wild-type rice spikelets have only one floret (Jeon et al. 2000). This suggests that 
OSMADS1 /LHS1 plays an important role in floral meristem determination during the early 
development of rice florets (Jeon et al. 2000). Other phenotypic characteristics of plants with 
missense mutations in OSMADS1 are elongated papery leafy paleas and lemmas, palea- and 
lemma-like lodicules, fewer stamens, and an increase in carpel number. The changes in 
lodicules and reproductive organs resemble class B loss-of-function mutants in grasses 
(Kang et al. 1998; Ambrose et al. 2000), suggesting that OsMADSl is required for this 
function. In addition to OsMADSl loss-of-function plants, putative gain-of- function plants 
have been studied. When OsMADSl is ectopically expressed, stunted panicles with irregular 
positioned branches and spikelets develop, and within spikelets, the glumes are transformed 
into palea/lemma-like organs (Prasad et al. 2001 ). Thus OsMADSl maybe sufficient to confer 
palea/lemma identity to glumes, while the Ihsl mutant suggests that it is not generally 
required for the specification of these organs. During spikelet development, OsMADSl 
expression is confined to the lemma, palea and carpel, but expression in the carpel is only 
weak (Chung et al. 1994; Prasad et al. 2001). This points again to the importance of OsMADSl 
for the development of palea and lemma. So even if we homologize these organs to the 
eudicot sepals, OsMADSl function would remarkably deviate from the class E gene function, 
which is confined to petal, stamen and carpel development. This is in line with phylogeny 
reconstructions, which indicate that within AGL2-like genes, OsMADS24 and OsMADS45 
are more closely related to the Arabidopsis class E (SEP) genes than OsMADSl , OsMADS5 
and OsMADS34 (Fig. 1; Munster et al. 2002). This again fits to the expression patterns known. 

OsMADS5 and OsMADS34 are transcribed in stamens and (weakly) in carpels, or in all 
plant tissues, respectively (Kang and An 1997; Pelucchi et al. 2002), which also makes them 
no obvious candidates for class E genes in a strict sense. OsMADS24 and OsMADS45 are 
expressed in lodicules, stamens and carpels (Greco et al. 1997). These genes thus show 
expression patterns very similar to that of AGL9 (especially if we homologize lodicules to 
petals); they are thus better class E candidate genes than the other AGL2-like genes from 
rice. 

Unfortunately, mutant phenotypes are not known for any AGL2-like gene 
from rice except OsMADSl ILHS1. Moreover, conservation of class E genes 
is still unclear even in eudicots, and both redundancy (Arabidopsis) and 
subfunctionalization into whorl-specific genes (Gerber a) complicate the 
definition of the class E gene function (Pelaz et al. 2000; Kotilainen et al. 
2000; Theifien 2001b). Obviously, more work is needed to determine to 
which extent the functions of AGL2-like genes in rice resemble the class E 
gene function in Arabidopsis. A general key to the function of the AGL2-like 
proteins could be their ability to interact with members of other subfamilies 
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in heterodimeric and higher order protein complexes (‘floral quartets’) 
(Fan et al. 1997; Honma and Goto 2001; Theifien 2001b; TheiCen and Saedler 
2001 ). 



3 Concluding Remarks 

opment in angiosperms candidates for floral homeotic genes from rice are 
easily found by identifying putative rice orthologs of the floral homeotic 
genes from Arabidopsis (and other eudicots, such as Antirrhinum and 
Petunia). In first approximation these are identified in phylogenetic trees 
as members of the same MADS-box gene subfamilies (TheiSen et al. 1996, 
2000; Munster et al. 2002). This does not imply, however, that the functions 
of the rice genes thus identified are equivalent or even identical to their 
eudicot orthologs. There are several reasons for this: 

1. Gene functions evolve, and even orthologous genes can functionally 
diverge during evolution (Theiflen 2002b). Independent gene duplica- 
tions in the lineages that led to extant eudicots and monocots have 
established orthologous relationships between clades of paralogous 
genes rather than individual genes (Munster et al. 2002), suggesting that 
subfunctionalization may have led to functional divergence between rice 
and Arabidopsis MADS-box genes. 

2. The evolution of genes and morphological structures are not necessarily 
congruent, and homology between the organs of Arabidopsis flowers 
and of rice spikelets is unclear in some cases. For example, the expres- 
sion oiDEF- and GLO-like genes in the lodicules of rice and other grasses 
and in eudicot petals could well reflect homology between lodicules and 
petals, but would also be compatible with an independent origin of 
lodicules from stamens in the lineage that led to extant grasses. The class 
B-like function of DEF- and GLO-like genes in grasses and eudicots may 
thus involve exclusively the specification of homologous organs, i.e. 
stamens, and lodicules or petals (class B-like function in a strict sense), 
or it may include the specification of nonhomologous organs (lodicules 
and petals) (class B-like function in a less strict sense). 

3. One trivial obstacle for a comprehensive comparison of floral homeotic 
gene functions in Arabidopsis and rice is, despite the ‘model’ character 
of both species, the fact that the extent to which candidate genes have 
been studied is still quite different in both species. For example, among 
the 13 AG-, AGL2-, DEF-, GLO- and SQUA-like genes in the Arabidopsis 
genome (candidate or proven class C+D, E, B, and A genes, respectively), 
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a mutant phenotype has been published for all except two genes ( AGL11 , 
AGL3). In contrast, for rice, the same total number of members of the 
mentioned gene subfamilies has been reported so far, but loss-of-func- 
tion phenotypes have been reported for just four genes ( OsMADSl , 
OsMADS3 , OsMADS4 y OsMADS16 ) (see Table 1). Thus while the func- 
tions of most Arabidopsis genes could be determined relatively precisely 
by the phenotypes of single or multiple null mutants, often only expres- 
sion patterns, or gain-of-function phenotypes (mostly even only in 
heterologous plants such as tobacco) give clues to the function of the 
corresponding rice genes (e.g., Kang and An 1997). So quite a lot of work 
lies ahead before a detailed and comprehensive comparison of floral 
homeotic gene functions between rice and Arabidopsis can be done. 

There is hardly any doubt, however, that intensive comparative studies of 
the floral homeotic genes (or their orthologs) from Arabidopsis and rice 
(Table 1) will tell us a great deal about the conservation and divergence of 
flower development throughout the angiosperms, especially if these are 
combined with comparative investigations on organ development and 
structure. One of the most relevant questions to answer is as to how gene 
evolution and morphological evolution are interlinked. The fact that gene 
evolution is often almost ‘clock-like 5 , while morphological evolution can be 
erratic, is one of the most intriguing observations to be explained. Studies 
on the small number of floral homeotic genes, and their huge impact on 
organogenesis, promise a better understanding of this enigmatic topic 
(Theiflen et al. 2000). The 160-200 million years of evolutionary separation, 
the apparent differences between their flowers, and the obvious similarities 
between their (putative or proven) floral homeotic genes make rice and 
Arabidopsis an obvious and attractive choice for studies on the relationship 
between the evolution of genes and morphological structures. A better 
understanding of this relationship could not only help us to better under- 
stand the molecular genetic basis of biodiversity on earth; it might also help 
us to eventually develop homeotic genes into tools for the improvement of 
crop plants by transgenic technology, or mutagenesis and marker assisted 
breeding. 
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Molecular Responses of Halophytes to High Salinity 

Dortje Golldack 



1 Effects of Salinity on Plants 

Excess amounts of salt (NaCl) occur as an abiotic environmental factor in 
certain habitats, e.g. as salt deserts in arid and semiarid areas, in coastal salt 
marshes, and along inland saline lakes. Next to the natural occurrence of 
salinity, salinization of soils has increasingly become a crucial problem for 
agriculture during the last decades. Salinization affects soils in particular 
in arid and semi-arid regions as well as in irrigated agronomic areas. For 
irrigation, availability of high-quality freshwater is often limited and water- 
containing traces of salt is used instead. Besides, salts are dissolved by 
irrigation within soil profiles, leading to increased accumulation in the soil 
surface layers. 

Today, about one third of irrigated agricultural land is salinized world- 
wide (Schachtman and Liu 1999). Soil salinization severely limits agricul- 
tural productivity because almost all crop plants are highly sensitive to 
elevated concentrations of NaCl. Concentrations as low as 25 mM NaCl are 
not tolerated by many plants and concentrations of 1 50 mM NaCl are highly 
toxic for most crop plants (Serrano and Gaxiola 1994; Rubio et al. 1995). A 
recent example for salt desertification caused by mismanagement of soil 
irrigation is the Aral Sea Basin where soil salinity affects millions of people 
depending on agricultural production in this region, resulting in enormous 
economical and social problems (Saiko and Zonn 2000). The ecological and 
economic impact of soil salinization has led to intensive research on the 
physiological and molecular mechanisms of salt adaptation in plants with 
particular focus on the tolerance mechanisms in naturally salt-tolerant 
plants (e.g. Hasegawa et al. 2000; Bohnert et al. 2001, Lauchli and Liittge 
2002). 

Plants having the genetic potential to grow on saline soils are halophytes 
that may tolerate elevated salt concentrations up to 500 mM NaCl. The 
capability to tolerate excess salt concentrations is of polyphyletic origin and 
an estimated amount of 5,000 to 6,000 angiosperm species from diverse 
plant families belong to the group of halophytes (Glenn et al. 1999). In 
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general, halophytes might be subdivided into salt avoiders and salt in- 
cludes. Salt-avoiding species exclude either excess NaCl by so-called root 
filtration as in some mangrove species or excrete NaCl via salt glands or 
vesiculated trichomes, e.g. as known for some halophytes of the Chenopo- 
diaceae. Salt-including halophytes are often succulents, e.g. species of the 
Aizoaceae that hyperaccumulate salts in the leaves. In succulents, increased 
water content in the central vacuoles enables the accumulation of large 
quantities of salt and minimizes NaCl toxicity in the cytoplasmic compart- 
ment. 



2 Main Mechanisms of Salt Adaptation in Halophytes 

Exposure to excess salt results in both hyperosmotic and hyperionic stress 
effects in plants. High salt concentrations in the soil as well as salt accumu- 
lation within the apoplastic compartment cause osmotic stress and water 
deficit, whereas uptake and accumulation of salt in the cytoplasmic com- 
partment causes damage of the metabolism by increased Na + and Cl“ 
concentrations and altered K + /Na + ratios. Although most halophytes use 
Na + (Fig. 1) and Cl" as osmolytes by accumulating these ions in the central 
vacuole, enhanced concentrations of Na + and Cl” in the cytoplasm are 
highly toxic. According to in vitro studies, concentrations of 100 mM Na + 
inhibit protein synthesis by competing with K + for binding sites at proteins 
(Blumwald et al. 2000). 




H. vulgare C. gayana 




o 125 500 o 125 500 (mM NaCl] 
P. maritima C. virescens 



Fig. 1. Accumulation of sodium, calcium, and potassium in the crop plant barley ( Hordeum 
vulgare) and in the halophytes Chloris gayana (Poaceae), Plantago maritima (Plantagi- 
naceae), and Carpobrotus virescens (Aizoaceae). The plants were adapted either to control 
conditions (0 mM NaCl) or treated with 125 and 500 mM NaCl for 48 h 
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A main strategy of halophytes to cope with osmotic stress is the synthesis 
of osmolytes (‘compatible solutes') and their cytoplasmic accumulation. 
Most halophytes accumulate the amino acid proline; besides, polyols are 
common osmolytes. The function of these hydrophilic, low molecular 
weight organic compounds consists in osmotic balancing as well as struc- 
tural stabilization of membranes and proteins. The second main mecha- 
nism for salt adaptation is the maintenance of the intracellular ion homeo- 
stasis with particular importance of regulating the cytoplasmic K + /Na + 
ratio. 



2.1 Yeast Serves as a Model System 

for Plant Salt Tolerance Mechanisms 

Yeast ( Saccharomyces cerevisiae) has been serving as a model system for 
studies on the molecular responses of salt adaptation not only in fungi but 
also in plants due to similarities in the regulation of osmotic and ion 
homeostasis. Accordingly, molecular key mechanisms of yeast salt adapta- 
tion will be reviewed in the following. 

Glycerol-3-phosphate dehydrogenase encoded by the genes GPD1 and 
GPD2 is the key enzyme for the synthesis of the compatible solute glycerol 
and is transcriptionally activated in salt-stressed yeast (Posas et al. 2000). 
Yeast regulates the cytoplasmic K + /Na + ratio by excluding Na + from cells 
and by accumulation in pre-vacuolar compartments and vacuoles. The 
export of Na + from cells is mediated by the plasma membrane-localized 
Na + /H + -antiporter NHA1 and by the Na + -ATPase ENA1, whereas the 
Na + /H + -antiporter NHX1 mediates the transport of Na + into the vacuolar 
compartments (e.g. Nass et al. 1997). In addition to Na + detoxification, 
NHX1 serves other functions in yeast as well. Hyperosmotic concentrations 
of sorbitol inhibited cell division in nhxl mutants, indicating involvement 
of Na + sequestered in the vacuoles in osmotically driven import of water 
via aquaporins (Nass and Rao 1998). Besides, increased Na + and H + con- 
centrations are also the driving force for Cl"-uptake that is mediated by Cl" 
channels of the Gefl type in yeast (Flis et al. 2002). 

Osmotic signaling in yeast is mediated by the HOG (high osmolarity 
glycerol response) pathway including the HOG1 gene that codes for a 
mitogen-activated kinase and causes increased biosynthesis and accumu- 
lation of glycerol (Hohmann 2002). In addition, there is a calcineurin 
involving pathway responding specifically to ionic stress. This pathway 
regulates the homeostasis of K + , Na + , and Ca 2+ in salt-stressed yeast and 
causes transcriptional activation of Na + -ATPases and Ca 2+ -ATPases (e.g. 
Matheos et al. 1997). 
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2.2 Salt-Dependent Regulation of Ion Homeostasis 
in Halotolerant Plants 

Successful salt adaptation of halotolerant plants depends on maintenance 
of cytoplasmic K + /Na + homeostasis with discrimination of K + over Na + . 
Although no specific entry system for Na + is known on the molecular level 
yet, competition with the macronutrient K + for cellular influx is likely 
(Blumwald et al. 2000; Fig. 2). Candidates to mediate Na + influx are, e.g. 
voltage-independent monovalent cation channels (VIC; White 1999) that 
have not been identified on the molecular level yet, and LCTl-type low- af- 
finity cation transporters that have been characterized in wheat (Clemens 
et al. 1998). HKTl-type transporters isolated from wheat, Arabidopsis 
thaliana , rice, and the halophyte Mesembryanthemum crystallinum medi- 
ate Na + influx when heterologously expressed in Xenopus oocytes (Schacht- 
man and Schroeder 1994; Uozumi et al. 2000, Golldack et al. 2002; H. Su, E. 
Balderas, R. Vera-Estrella, D. Golldack, F. Quigley, C. Zhao, O. Pantoja, and 
H J. Bohnert, unpubl. results). In salt-stressed M. crystallinum and in the 
moderately salt- tolerant rice line Oryza sativa var. Pokkali a transcriptional 
decline occurs for HKTl-type K + -transporters (Golldack et al. 2002; H. Su, 
E. Balderas, R. Vera-Estrella, D. Golldack, F. Quigley, C. Zhao, O. Pantoja, 




Fig. 2. Ion transport proteins involved in K + and Na + homeostasis in plant cells. SOS1 
Na + /H + antiporter; HAK K + transporter; HKT K + /Na + symporter or Na + transporter; 
P-ATPase P-type H + -ATPase; V-ATPaseV - type H + -ATPase; PPiase vacuolar H + -pyrophos- 
phatase; IRK inward-rectifying K + channels; ORK outward-rectifying K + channels; VIC 
voltage-independent cation channels; VK vacuolar K + channels; ITR Na + /myo-inositol 
symporter; NHX Na + /H + antiporter; LCT1 low-affinity cation transporter. See text for 
further details 




Molecular Responses of Halophytes to High Salinity 



223 



and H .J. Bohnert, unpubl. results) as well as for AKT/KAT-type inward-rec- 
tifying K + channels (Su et al. 2001; Golldack et al. 2003) that are highly 
selective for K + but do not completely exclude Na + (Amtmann and Sanders 

1999) . In contrast, members of the HAK/KUP-type dual-affinity K + -trans- 
porter family are transcriptionally activated in salt-stressed Af. crystal- 
linum. When expressed in a trkl/trk2 yeast mutant deficient in K + uptake, 
McHAKl and McHAK4 mediated K + but not Na + influx indicating that 
McHAK transporters become the major K + uptake system in salt-exposed 
M. crystallinum by efficiently excluding Na + (Su et al. 2002). 

Whereas there is no evidence for Na + -extruding Na + -ATPases homolo- 
gous to yeast ENA1 in plants yet and the gene AtSOSl encoding a plasma 
membrane Na + /H + antiporter homologous to bacterial and yeast Na + /H + 
antiporters has been identified only in the glycophyte A. thaliana (Shi et al. 

2000) , but not in halophytes as yet, the vacuolar Na + sequestration is similar 
in yeast and in plants. Vacuolar uptake of Na + is mediated by a secondary 
active Na + /H + antiport that is energized by the vacuolar H + -ATPase (V-AT- 
Pase). The activity of this enzyme as well as its transcription and translation 
are stimulated by salt stress in halophytes, such as M. crystallinum and 
Suaeda salsa (Ratajczak et al. 1994; Golldack und Dietz 2001; Wang et al. 

2001) . Salt-induced expression of V-ATPase subunits has also been shown 
for salt-adapted suspension cells of tobacco and for salt-tolerant Beta 
vulgaris (Narasimhan et al. 1991; Kirsch et al. 1996; Lehr et al. 1999). In 
contrast, no salt-induced transcription of V-ATPase subunits has been 
observed for glycophytes, such as A. thaliana (Kluge et al. 1999) indicating 
salt-dependent activation of V-ATPase to be a specific response mechanism 
in halophytes that is missing in glycophytes. The vacuolar H + -pyrophos- 
phatase that is the second electrogenic H + -pump at the tonoplast shows 
increased protein amounts in S. salsa (Wang et al. 2001) and increased 
transcript levels in the halotolerant Lobularia maritima (O.V. Popova and 
D. Golldack, unpubl. results) in response to salt stress. 

NHX-type transporters have a key function in maintenance of cytoplas- 
mic Na + homeostasis by mediating a secondary active Na + /H + antiport at 
the tonoplast (Apse et al. 1999). Salt-induced Na + /H + antiport has been 
shown at isolated tonoplast membrane vesicles of the halophytes M. crys- 
tallinum and Salicornia bigelovii (Barkla et al. 1995; Parks et al. 2002). In 
the salt-tolerant Plantago maritima a vacuolar Na + /H + antiport has been 
identified that is missing in the closely related salt-sensitive species P. 
media (Staal et al. 1991). On the molecular level, the first NHX-type Na + /H + 
antiporter from plants was identified and cloned with AtNHX 1 from A. 
thaliana based on sequence information from the Arabidopsis genome 
sequencing initiative (Apse et al. 1999; Gaxiola et al. 1999; Quintero et al. 
2000), whereas the first NHX-type transporter cloned from a halophyte was 
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AgNHXl from Atriplex gmelini (Hamada et al. 2001). Exposure of A. gmelini 
to 100 and 400 mM NaCl, respectively, caused a three- to eightfold increase 
in the transcript amount of AgNHXl (Hamada et al. 2001). Reports on the 
transcriptional regulation of NHX-type transporters in glycophytes in re- 
sponse to salt are somewhat controversial. Whereas Gaxiola et al. (1999) in 
A, thaliana and Fukuda et al. (1999) in rice showed increased expression of 
this transporter type, Apse et al. (1999) and Chauhan et al. (2000) found no 
increase in transcript and protein amounts as well as Na + /H + -antiport 
activity of AtNHXl and AtNHX2 , respectively, in salt-stressed Arabidopsis . 
Transgenic overexpression of AtNHXl caused increased vacuolar Na + /H + - 
transport activity and salt tolerance compared to wild-type Arabidopsis . 
Whereas a concentration of 200 mM NaCl was lethal to the wild type, this 
concentration was tolerated by the transgenic Atnhxl plants (Apse et al. 
1999). Besides, also in tomato and Brassica napus overexpression of 
AtNHXl resulted in increased salt tolerance that was accompanied by 
increased Na + accumulation. The transgenic B . napus plants accumulated 
Na + up to 6% of its dry weight, whereas the concentration of K + declined 
to about 20% compared to non-stressed plants (Zhang und Blumwald 2001; 
Zhang et al. 2001). 

Interestingly, AtNHXl- overexpressing A. thaliana displayed increased 
salt tolerance accompanied by stimulated Na + /H + -antiport activity but no 
significant increase in the AtNHXl transcripts and protein compared to 
control plants (Apse et al. 1999), indicating that overexpression oiAtNHXl 
might activate additional mechanisms involved in salt tolerance, as e.g. 
salt-specific signaling, regulation of water uptake or homeostasis of other 
ions such as Cl". Mutation of IpNHXl in Ipomoea nil inhibited the develop- 
ment of blue color in the flowers that depends on the vacuolar pH, indicat- 
ing additional function of NHX-type transporters for regulation of vacuolar 
pH and probably also involvement in the homeostasis of other cations in 
plant cells (Fukada-Tanaka et al. 2000). 



2.3 Synthesis and Accumulation of Osmoprotective Compounds 

Salt-tolerant plants accumulate the amino acid proline, quaternary ammo- 
nium compounds (e.g. glycine betaine), polyols, and sugars, respectively, 
as osmoprotective compounds that are synthesized via pathways that are 
connected to the basic plant metabolism. 

Salt-induced synthesis and accumulation of proline has been reported 
from numerous halophytes (e.g. Maggio et al. 2000). In response to salinity, 
proline is synthesized de novo from glutamate (Nanjo et al. 1999). Accord- 
ingly, the requirement for nitrogen increases in salt-stressed plants. In M. 
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crystallinum , expression of the nitrate transporter homologue McNRTl 
increased in root epidermal cells and in the vasculature indicating stimu- 
lated uptake and in planta transport of nitrogen under salinity (O.V. 
Popova and D. Golldack, unpubl. results). Key enzymes of the biosynthesis 
of proline are A^pyrroline-S-carboxylate synthase (P5CS) and A^pyr- 
roline-5-carboxylate reductase (P5CR) that catalyze the conversion of glu- 
tamate to proline. The NADP-specific isocitrate dehydrogenase (ICDH) has 
a key function in the biosynthesis of amino acids by generation of 2-oxo- 
glutarate that is the primary acceptor for NH 3 assimilation. In leaves of M. 
crystallinum, enzyme activity as well as transcript and protein amounts of 
the NADP-specific ICDH McICDHl increased in response to salt stress. 
However, enzyme activity and transcript levels of ferredoxin-dependent 
glutamate synthase which catalyzes the synthesis of the proline precursor 
glutamate increased during short exposure of M. crystallinum to salinity 
but decreased in salt-adapted plants (Popova et al. 2002). These data indi- 
cate that de novo biosynthesis of proline is stimulated during the initial 
phase of salt stress but accumulation in salt-adapted plants is caused by 
inhibited degradation of the amino acid. 

The second widely distributed compatible solute, the quaternary ammo- 
nium compound glycine betaine, is synthesized in chloroplasts in a two- 
step process by oxidation of choline to betaine aldehyde that is mediated 
by the salt-inducible choline monooxygenase (CMO) followed by a second 
oxidation step catalyzed by NAD-dependent betaine aldehyde dehydro- 
genase (Russel et al. 1998; McNeil et al. 1999). From halophytes of the 
Plumbaginaceae, four additional quaternary ammonium compounds are 
known that replace or supplement glycine betaine. (5-Alanine betaine is 
synthesized by methylation of (5-alanine. (5- Alanine betaine is likely to be a 
favored compatible solute under the hypoxic conditions of salt marshes, 
because there is no direct requirement of O 2 for the biosynthesis of this 
compound (Hanson et al. 1994). In contrast, proline betaine and hy- 
droxyproline betaine have been particularly found in Plumbaginaceae from 
dry or saline conditions (Hanson et al. 1994). Choline O-sulfate is likely to 
have not only a role in osmoprotection but also in detoxification of sulfate 
that may be found in excess amounts in saline environments as well. 
Choline is converted to choline O-sulfate by a 3’-phosphoadenosine-5’- 
phosphosulfate-dependent choline sulfotransferase. Increased activity of 
this enzyme is induced by NaCl in leaves and roots of e.g. Limonium species 
(McNeil et al. 1999). 

Dimethylsulfonium propionate (DMSP) is a tertiary sulfonium com- 
pound that is an analogue of glycine betaine and has been found in halo- 
phytes of the Compositae, as e.g. the coastal plant Wollastonia biflora and 
in Poaceae, e.g., as the salt marsh grass Spartina alternifolia (Kocsis et al. 
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1998). The precursor of DMSP, methionine, is converted to S-methyl-L- 
methionine (SMM) in the cytosol. SMM is imported then into chloroplasts, 
deaminated and decarboxylated and converted to DMSP by a dehydro- 
genase (Trossat et al. 1998). A cDNA encoding the enzyme S-adenosyl 
MetiMet S-methyltransferase that catalyzes the SMM synthesis has been 
cloned from W. biflora (Bourgis et al. 1999). 

Increased synthesis and cytosolic accumulation of compatible solutes of 
the group of polyols is known from salt-stressed M. crystallinum . Myo- 
inositol is synthesized from glucose-6-phosphate via myo - inositol 1 -phos- 
phate catalyzed by the enzymes myo - inositol 1 -phosphate synthase (INPS) 
and myo - inositol 1 -phosphate phosphatase. Myo - inositol is methylated to 
D-ononitolby the myo - inositol O-methyltransferase (IMT) and D-ononitol 
is epimerized to D-pinitol. In salt stressed M. crystallinum , the protein of 
IMT increased strongly in leaves and slightly in roots, whereas INPS was 
only upregulated in leaves and inhibited in roots but inositol and ononitol 
were transported to the roots via the phloem (Nelson et al. 1998, 1999). 
Tonoplast located Na + /rayo-inositol symporters (ITR) have been identified 
in M. crystallinum and have been suggested to function in the transport of 
Na + from root cell vacuoles to leaves (Chauhan et al. 2000; Fig. 2). 



2.4 Protection Against ROS (Reactive Oxygen Species) 

Next to osmotic and ionic stress under high salinity, plants are also affected 
by secondary stresses, e.g. the salt-induced excessive generation of reactive 
oxygen species (ROS), such as hydroxyl radicals, superoxide anions, and 
hydrogen peroxide by Fenton type reactions. Excess ROS formation in 
salt-stressed plants is caused by water deficits due to the osmotic compo- 
nent of salinity that leads to reduced CO 2 fixation and reduced regeneration 
of NADP + in the Calvin cycle (e.g. Vranova et al. 2002). ROS may damage 
e.g. cell walls, membranes, proteins, and DNA by non-specific oxidation 
(e.g. Schiitzendiibel and Polle 2002). 

In addition to osmotic balancing, another important function of os- 
molytes is the oxidative detoxification by scavenging of ROS. In transgenic 
tobacco, expression of a bacterial mannitol- 1 -phosphate dehydrogenase in 
chloroplasts caused accumulation of the polyol mannitol and increased 
tolerance to oxidative stress (Shen et al. 1997). Other antioxidant metabo- 
lites used by plants for ROS detoxification are, e.g., ascorbate, being a 
substrate for ascorbate peroxidase, thioredoxin, glutathione, and a-toco- 
pherol (Asada 1999). Enzymatic detoxification of ROS is performed, e.g. by 
superoxide dismutase (SOD) that converts superoxide to H 2 O 2 , catalase 
that catalyzes the detoxification of H 2 O 2 to water and oxygen, and peroxi- 
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dases (Asada 1999). In calli of Helianthus annuus that are moderately 
tolerant to elevated concentrations of KC1, activities of SOD, catalase, and 
peroxidase increased (Santos et al. 2001). Overexpression of glutathione 
S-transferase and glutathione peroxidase resulted in increased growth of 
transgenic tobacco exposed to salt stress (Roxas et al. 1997). In M. crystal- 
linurriy salt stress causes increased enzyme activity of NADP-dependent 
isocitrate dehydrogenase, and up-regulated expression of McICDHl was 
particularly observed in leaf mesophyll cells (Popova et al. 2002). A role of 
NADP-dependent isocitrate dehydrogenase in supplying NADPH to the 
glutathione reductase/gluthatione peroxidase system for detoxification of 
H 2 O 2 has been suggested (Popova et al. 2002). 



2.5 Signal Transduction and Salt Adaptation 

Regulatory signaling elements of salt-adaptation pathways have been rarely 
identified in halophytes yet but most knowledge on salinity-induced signal 
transduction is derived from the salt-sensitive model plant A . thaliana. 
Transcription factors of the MYC, MYB, and bZIP type interact with pro- 
moters of genes that are responsive to osmotic stress in A. thaliana (Shi- 
nozaki and Yamaguchi-Shinozaki 2000). Overexpression of a cDNA encod- 
ing the transcription factor DREB 1A that interacts with the Arabidopsis 
cis- acting ‘dehydration response element 5 DRE caused increased salt and 
drought tolerance in A. thaliana (Kasuga et al. 1999). In Arabidopsis , a 
signaling pathway responsive to salt stress includes the Ca 2+ -binding pro- 
tein SOS3 that activates the serine/threonine protein kinase SOS2. Salt-in- 
duced expression of the plasma membrane Na + /H + antiporter SOS1 that 
has been described from A. thaliana but not from halophytes yet, may be 
regulated by SOS3 and SOS2 (Halfter et al. 2000). A salt-induced mitogen- 
activated protein kinase (M APK) has been identified from alfalfa with SIMK 
that is activated by the MAPK kinase SIMKK (Kiegerl et al. 2000). 

In the salt-tolerant grass species Lophopyrum elongatum that is native 
to salt marshes of the Mediterranean region, expression of the ser- 
ine/threonine kinase Esi47 is induced by salt stress of 250 mM NaCl and by 
abscisic acid (Shen et al. 2001). In M. crystallinum , transcription of the 
salt-inducible phosphoenolpyruvate carboxylase isoform Ppcl was stimu- 
lated by the Ca 2+ -ionophore ionomycin and the Ca 2+ -ATPase inhibitor 
thapsigargin indicating that Ca 2+ has a role in signaling events that lead to 
salt-dependent Ppcl -induction (Taybi and Cushman 1999). Salt-inducible 
expression of the V-ATPase subunit E, of Ppcl y and of the key enzyme of 
polyol synthesis, myo-inositol O-methyltransferase (Imtl), could be inhib- 
ited by treatment with the activator of trimeric G-proteins mastoparan 
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suggesting involvement of G-proteins in salt-induced signaling (Golldack 
and Dietz 2001). 



3 Functional Genomics of Salt Adaptation 
3.1 How Many Genes are Salt-Responsive in Plants? 

Although a number of genes that are salt responsive in halophytes have 
been identified, the process of salt adaptation in its complexity is not 
understood yet. To understand the network of processes occurring under 
salt stress and adaptation to high salinity in salt-tolerant plants expression 
studies on the genomic scale are highly necessary. 

The only halophyte that has been analyzed in depth for its molecular 
mechanisms involved in salt adaptation is the salt-tolerant succulent Me- 
sembryanthemum crystallinum that has been becoming a model plant for 
salt stress research during the last decade. Based on EST-sequencings ofM. 
crystallinum it is estimated that 10% of the total gene number of M. 
crystallinum is responsive to salt stress (Cushman et al. 1999). Different 
genome sizes in plants are mainly due to different amounts of interspersed 
repetitive sequences (Bennetzen et al. 1998). Accordingly, it is estimated 
that higher plants may have similar amounts of functional genes (Som- 
merville and Sommerville 1999). Based on a number of approximately 
25,000 functional genes in the model plant A . thaliana (Kaiser 2000) the 
number of salt-responsive genes in M. crystallinum may be estimated to be 
up to 2,500. 



3.2 Global Analyses of the Network of Salt-Responsive Genes 
in Halophytes 

Taking into consideration that more than thousand genes maybe involved 
in salt stress adaptation in halophytes it is obvious that the molecular 
network of salinity stress responses is not understood yet in the model 
halophyte M. crystallinum and nearly nothing is known from other salt-tol- 
erant plant species. An innovative novel approach for systematic identifi- 
cation of salt-responsive genes in halotolerant plants on a genomic scale is 
the use of cDNA arrays that allow fast and comparable analyses of large 
transcript amounts in parallel. Using cDNA arrays it is possible to analyze 
expression profiles of unknown gene populations according to their func- 
tion in salt adaptation and then select and identify genes that were shown 
to be regulated by stress exposure. 
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However, most halophyte species are not amenable for studies on the 
transcriptome level yet due to limited information on their gene sequences 
and accordingly the difficulty to identify salt-regulated genes and their 
products. In contrast, the model plants with complete sequencing of their 
genomes - Arabidopsis and rice - are typical glycophytes that are not very 
salt-tolerant. For comparative genomics studies, halophyte species would 
be most suitable that are closely related to the completely sequenced plant 
models to take advantage of the wealth of genomic data from these species. 

One ideal halophyte model that meets these criteria is Lobularia mari- 
tima - a species of the Brassicaceae that is native to the Canary Islands and 
the Azores. Lobularia does not possess morphological specializations such 
as salt-accumulating bladder cells or salt-excreting glands but the salt 
tolerance of the species is due to adjustment in ion and osmotic homeosta- 
sis. Transcripts from Lobularia share in average 90% identity with homolo- 
gous genes from Arabidopsis (D. Golldack, unpubl. results). According to 
the high level of molecular identity, cDNA arrays synthesized for Lobularia 
may be cross-hybridized with cDNA populations from A. thaliana and 
other Brassicaceae species and expression profiles can be directly compared 
in the closely related salt-tolerant and salt-sensitive species in response to 
salt stress. 

In our studies, we used an array including 480 transcripts generated from 
a subtracted cDNA library of Lobularia that was enriched for salt-induced 
genes to study time and dose dependent responses of the species to salinity. 
Approximately 40% of the elements included in the array showed altered 
transcript abundance under salinity. An early, transient modification of 
gene expression could be distinguished from a second set of ESTs with 
modified transcript levels during long-term acclimation. The identified 
products with altered expression could be grouped into seven functional 
classes: (1) a large group of proteins of yet unknown function, (2) elements 
that control gene expression, (3) signal transduction elements, (4) metabo- 
lic enzymes, e.g. glutamine synthetase, (5) transport proteins, (6) photo- 
synthetic proteins, and (7) the largest group with proteins involved in 
general stress response and defense (Fig. 3; O.V. Popova, S. Pabion, and D. 
Golldack, unpubl. results). 



4 Perpectives 

A complex network of salt-responsive genes involved in salt adaptation in 
halotolerant plants has been identified and novel salt stress tolerance genes 
are likely to be identified from large-scale transcriptome analyses in the 
near future. As a next step, the functions of these genes need to be investi- 
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□ Un-Known Proteins 

■ Control of Gene Expression 

□ Signal Transduction 

■ Metabolism 

□ Transport 

□ General Stress Response 

□ Photosynthesis 



Fig. 3. Functional categories of salt-responsive genes identified from a cDNA array of the 
salt-tolerant Lobularia maritima (Brassicaceae; O.V. Popova, S. Pabion, and D. Golldack, 
unpubl.) 



gated, e.g. by examining knock-out mutants or using specific antisense 
approaches as posttranscriptionally silenced genes by RNAi (Chuang and 
Meyerowitz 2000; Escobar 2001). Detailed functional studies of transcrip- 
tion factors, promoter elements, and signal transduction pathways that 
regulate stress responses will further enhance our knowledge of salt adap- 
tation in naturally halotolerant plants. 
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High Frequency or Ultradian Rhythms in Plants 

Ulrich Liittge and M.-Th. Hiitt 



1 The Quest for the Biological Clock or Circadian Oscillator and: 
w Can We Learn from Ultradian Rhythms?” 

Biorhythm research is currently dominated by the quest for the biological 
clock or circadian oscillator on a genetic basis. One of us has reviewed the 
plant biological literature in a contribution to last year’s volume of this 
series (Liittge 2002). However, while spontaneous ultradian frequency 
rhythms remain a central topic in neurobiology with the electrical firing of 
neurons, and oscillations of cyclic adenosine-monophosphate (cAMP) se- 
cretion, attraction and pattern formation of amoebae of the slime mold 
Dictyostelium (e.g. Berridge and Rapp 1979) and the oscillatory behaviour 
of glycolysis (e.g., Berridge and Rapp 1979; Olsen and Degn 1985; Rapp 
1986; see also below Sect. 2.2) are much studied, astonishingly little research 
effort is currently devoted to ultradian high frequency oscillations in plant 
biology. This is regrettable because much can be learnt from the study of 
high frequency rhythms. This is so not only because it is often much easier 
to keep systems studied under strict experimental control and obtain more 
readily sufficiently extended time series of measurements for theoretical 
analysis than with the much longer circadian oscillations. Although we 
must remember that in Drosophila both the circadian rhythm of eclosion 
and locomotor activity and the very short ultradian rhythm of the male 
courtship song seem to be governed by the same PER gene (Dunlap 1993), 
studying high frequency oscillations mostly implies that we set ourselves 
free from being paralyzed by the evolutionary and genetic biological clock 
enigma. 

Among the ultradian cellular oscillators, Berridge and Rapp (1979) 
distinguished two types, (1) cytoplasmic oscillators, where the periodic 
phenomena are generated by an instability in metabolic pathways, and (2) 
membrane oscillators generating rhythms of membrane potential. There is 
a large variety of ultradian rhythms and there is no evidence for a common 
oscillatory mechanism at the cellular level (Berridge and Rapp 1979). 
Therefore, in contrast to circadian rhythms, we may be more open for 
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system analysis in order to unravel governing principles of nonlinear 
dynamics. This is generic and it may be one of the reasons of the great 
success of studies of, e.g., Dictyostelium and glycolysis. It has much bearing 
also on our understanding of circadian rhythmicity (see Liittge 2002). 

Thus, it appears opportune to review here the respective information 
available in plant biology. This will be done in the following section describ- 
ing phenomena of ultradian rhythms in membranes, metabolism and iso- 
lated enzymes, cell development, stomata and photosynthesis, movements 
of leaves and circumnutation, and root elongation. A theoretical section will 
then pick up key elements of nonlinear dynamics that became obvious 
while looking at the phenomena. This theoretical assessment will aim at 
giving an answer to the question “can we learn from ultradian rhythms?” 
by fathoming “what can we learn from ultradian rhythms?” Finally, we ask 
if ultradian rhythms may have functions and contribute to fitness achieved 
during evolution. 



2 Phenomena 
2.1 Membranes 

The most conspicuous example on everybody’s mind of high frequency 
ultradian rhythmicity involving membranes is the action potential firing of 
neurons. However, spontaneous rhythmic changes of electrical membrane 
potentials, i.e., series of spontaneous action potentials, membrane oscilla- 
tors sensu Berridge and Rapp (1979), with period lengths in the range of 
seconds to minutes and several tens of minutes are also observed in fungi 
such as Neurospora crassa (Gradmann and Slayman 1975; Slayman et al. 
1976), green algae such as Nitella, Chara and Acetabularia (Gradmann 
1976; Ogata and Kishimoto 1976; Vucinic et al. 1978; Martens et al. 1979; 
Hayashi et al. 1982; Fisahn et al. 1986, 1989; Lucas and Fisahn 1989) and 
higher plants, e.g., Rorippa nasturtium aquaticum (Ping and Lou 1990). The 
literature up to the late 1980s is well reviewed in the introduction of Fisahn 
et al. (1989). Proton pumping at the plasma membrane evidently is involved 
(Gradmann and Slayman 1975; Fisahn et al. 1989; Tyerman et al. 2001). 
Sinusoidal electrical stimulation of giant internodal cells of Nitella with 
varied frequencies showed entrainment, quasiperiodic behaviour and de- 
terministic chaos (Hayashi et al. 1982, 1983; Olsen and Degn 1985). Ultra- 
dian oscillations of H + , K + , Ca 2+ and Cl" fluxes with different period lengths 
of a few minutes and 50-80 min, respectively, at different regions of the root 
surface have been described by Shabala and Knowles (2002). They may be 
important in nutrient acquisition via the mature root zone. These observa- 
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tions also raise questions of synchronization/desynchronization between 
these different ion fluxes and also between different regions of the root 
surface. 

Thus, the most conspicuous phenomena of nonlinear dynamics are 
expressed. Molecular entities involved are up to several different “electro- 
enzymes” such as channels, pumps and carriers (Gradmann 2001), e.g., in 
the plasma membrane of isolated root protoplasts, the H + - ATPase, inward 
and outward K + rectifiers and gated proton channels (Tyerman et al. 2001). 
Ultradian oscillations of net flux of K + and external pH occur in osmo- and 
turgor-regulation after the onset of hyperosmotic shock in Arabidopsis 
epidermal root cells and appear to be related to voltage gated K + trans- 
porters in feedback loops of cell osmotic adjustment (Shabala and Lew 
2002). To date the question appears to remain open if these oscillations of 
membrane system require cellular functions as suggested by the work with 
isolated protoplasts, where feedback control of cellular ion relations (Grad- 
mann and Hoffstadt 1998) and metabolism (see below Sect. 2.2) may be 
involved (Tyerman et al. 2001), or if they may also be generated by isolated 
membranes. In fact, the electro-enzymes may be electrically coupled and 
interact with each other via voltage changes (Gradmann et al. 1993; Grad- 
mann and Buschmann 1997; Gradmann 2001). The isolated wheat root 
protoplasts of Tyerman et al. (2001) exhibited oscillations in proton flux, 
and either free-running membrane potential oscillations under current 
clamp or free-running current oscillations under voltage clamp. Gradmann 
(2001) presents mathematical models, using two- and five-membrane 
transporters, respectively, in which oscillations are produced not only by 
local changes of ion concentrations, as for the apoplast of cells in parenchy- 
mas, but also by mere electrical coupling of the voltage-dependent electro- 
enzymes in the membrane. 

Calcium ions are known to be important secondary messengers. Cyto- 
plasmic Ca 2+ concentrations control a vast variety of physiological proc- 
esses (Trewavas 1999; Rudd and Franklin-Tong 2001). This includes regu- 
lation of channels, ion pumps and carriers by gating and Ca 2+ -dependent 
phosphorylation. Ca 2+ pumps (Liang and Sze 1998; Sze et al.2000) and Ca 2+ 
channels (White 2000) participate in this regulation network. Calcium 
waves are involved in this signalling (Trewavas 1999). Cytoplasmic Ca 2+ 
levels themselves show ultradian oscillations (Felle 1988; McAinsh et al. 
1990, 1995; Gilroy etal. 1 991; Read etal. 1993; Volotovskietal. 1993;Swillens 
et al. 1994; Bauer et al. 1998) and it has been suggested that the membrane 
oscillator maybe a plasma membrane Ca 2+ channel (McAinsh et al. 1995). 
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2.2 Metabolism and Enzymes 

Ultradian oscillations in metabolic functions belong to the cytoplasmic 
oscillations sensu lato distinguished by Berridge and Rapp (1979; see also 
Rapp 1986). Various functions of isolated mitochondria have been shown 
to oscillate (Berridge and Rapp 1979; for photosynthesis see Sect. 2.5). 
Ultradian oscillations of glycolysis as a linear reaction chain of several 
enzymes with the now well-known allosteric control sites and feedback 
loops covered in all textbooks of biochemistry were discovered in the 1960s 
(Ghosh and Chance 1964). They have been extensively studied ever since 
(Berridge and Rapp 1979; Olsen and Degn 1985; Teusink et al. 1996a, b, 1998; 
Teusink 1999; Bier et al. 2000). It was shown that glycolysis can be driven 
into (deterministic) chaos under periodic supply of glucose (Markus et al. 
1984). Ultradian oscillations as well as chaos occur in glycolysis of cell-free 
extracts of Saccharomyces cerevisiae (Chance et al. 1964; Markus et al. 1984; 
Olsen and Degn 1985). This demonstrates that oscillatory behaviour does 
occur in the absence of immediate genetic control. Theoretical models of 
glycolysis simulating glycolytic oscillations have been produced (Markus 
and Hess 1984; Teusink et al. 1996a), and it can be shown that oscillations 
are not dictated by a single key enzyme or “oscillophore” (Teusink et al. 
1996a; Teusink 1999). The primary signalling molecule is glucose (Teusink 
et al. 1998). 

The most simple conceivable oscillators consist of reactions which com- 
prise only a single enzyme and its substrates (Hauser et al. 2002). Indeed, 
for the argument that oscillations can occur as a system-property totally 
independent of genetic oscillators it is even more relevant that oscillatory 
behaviour is produced by single enzymes. Rhythmic variations of ferre- 
doxin-glutamate synthase activity in sunflower leaves with a period length 
of 4-5 h have been described as a phenomenon (Fernandez-Conde et al. 
1995). Ultradian oscillations of the catalytic capacity of malate dehydro- 
genase in extracts of leaves of the facultative CAM-plant Kalanchoe bloss- 
feldiana cv. Tom Thumb have been explained by conformational changes 
due to changing water aggregates and entropy variations in the protein/sol- 
vent system, and a conceptual analogy to the totally inorganic Belusov- 
Zhabotinsky chemical oscillator was stated (Queiroz-Claret et al. 1988). 

Detailed studies have been performed on the peroxidase-oxidase (PO) 
enzyme of roots of horse radish catalysing the reaction 

0 2 + 2NADH + 2H + 2H 2 0 + 2NAD+ 

which is involved in lignin biosynthesis. In a homogenous system with a 
stirred buffer medium, the enzyme shows nonlinear phenomena, such as 
bistability, oscillations and chaos due to the complicated branched chain 
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nature of the reaction mechanism of the enzyme with its five distinct active 
forms (Olsen 1983; Olsen and Degn 1985; Hauser and Olsen 1996, 1999; 
Hauser et al. 1997; Bronnikova et al. 1998; Moller et al. 1998; Kirkor et al. 
2000). The PO from other sources, e.g., soybean, a fungus and mammals, 
shows similar behaviour (see Kummer et al. 1997; Hauser and Olsen 1998, 
1999; Hauser et al. 2000). The enzyme requires catalytic amounts of pheno- 
lics as electron mediators of electron flow in the reaction. Naturally occur- 
ring phenolics are effective in the reaction, and hence, it can be assumed 
that oscillations also occur in vivo (Kummer et al. 1997; Hauser and Olsen 
1998). 

The routes into chaos comprise a period-doubling-route (Feigenbaum 
route) and a period-adding-route (mixed-mode oscillations) depending on 
the pH of the medium (Hauser and Olsen 1996, 1999). Thus, the PO reaction 
may reproduce all the richness of the non-linear behaviour of the inorganic 
Belusov-Zhabotinsky oscillator (Larter et al. 1993; Hauser and Olsen 1999). 



2.3 Cell Development and Pollen Tubes 

In the marine microheterotroph Traustochytrium sp., closely related to 
heterokont algae, non-damping endogenous ultradian oscillations become 
evident during development. As cell development progresses from the 
zoospore stage towards maturity, oscillations in the period range of several 
minutes of fluxes of H + , Na + and Ca 2+ were expressed. They may be causally 
linked to cell development. H + -fluctuations could be associated with 
changes in cytoskeleton organization required for cell cleavage and zoo- 
spore release (Shabala et al. 2001). 

Another interesting example is rhythmic tip growth as it occurs in fungal 
hyphae (Kaminskyj et al. 1992; Lopez-Franco et al. 1994; Bracker et al. 1995) 
and pollen tubes (Tang et al. 1992; Pierson et al. 1995; 1996; Geitmann et al. 
1996; Geitmann and Cresti 1998; review Feijo et al. 2001). In the pulsating 
tip growth of pollen tubes two types of oscillations can be distinguished, 
i.e., sinusoidal oscillations with periods in the range of seconds and spike- 
shaped oscillations with periods of several minutes (Geitmann and Cresti 
1998). Ca 2+ influxes (Pierson et al. 1995, 1996) and a Ca 2+ -channel oscillator 
may be involved in this pulsation growth. Ca 2+ -channel blockers such as 
La 3+ and Gd 3+ reversibly suppress rhythmicity of growth (Geitmann and 
Cresti 1998). It is intriguing, however, that pollen tube growth and the 
intracellular calcium gradient oscillate in phase while extracellular calcium 
influx is delayed (Holdaway-Clark et al. 1997) and that periodic increases 
in elongation rate precede increases in cytosolic Ca 2+ (Messerli et al. 2000). 
The cytoskeleton may play a role in these oscillations because there is a 
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concerted action of Ca 2+ , vesicle exocytosis and cytoskeleton, and cell wall 
in pollen tube growth (Derksen 1996). Fluctuations in the liberation of 
secretory vesicles (Lopez-Franco et al. 1994; Bracker et al. 1995) possibly 
controlled by the cytoskeleton accompanied by changes in turgor pressure 
(Kaminskyj et al. 1992) have been discussed as causes of the oscillations. 
This also directed attention to mechanical stress or stretch-sensitive Ca 2+ 
channels and gating by elastic tension in the membrane (Ramahaleo et al. 
1996) or by the submembrane cytoskeleton (Sokabe et al. 1991) as possible 
mechanisms mediating the oscillations (Geitmann and Cresti 1998). A 
recent discovery of a new oscillatory component, namely chloride efflux at 
the pollen tube apex, also sheds new light on Ca 2+ interactions. Oscillations 
of Cl - efflux correlate with cell volume, turgor status and extension growth. 
They are targeted by the signalling molecule inositol 3,4,5,6-tetra/czsphos- 
phate known to regulate Ca 2+ -activated Cl“ channels in mammalian cells 
and may interfere with vesicle movements during cell elongation (Zonia et 
al. 2002). 



2.4 Stomata and Photosynthesis 

Ever since the discovery of u Stomata als Glieder eines schwingungsfahigen 
C02-Regelsystems ” (Raschke 1965), stomata have been a key example of 
cybernetics in plants. Ultradian oscillations of stomatal aperture and con- 
ductivity for water vapour have been reported over decades (Barrs 1971; 
Teoh and Palmer 1971; Cowan 1972a,b; Kaiser and Kappen 2001; Roelfsema 
and Hedrich 2002). Primary external signals of control parameters are 
hydraulic effects and leaf internal airspace CO 2 (Willmer 1988) affecting a 
highly complex internal regulation network. Cytoplasmic Ca 2+ levels play 
a key role as a central node in this network (McAinsh et al. 1997), and a 
plasma membrane Ca 2+ channel may be an important oscillator in the 
system (McAinsh et al. 1995). 

However, just as for circadian rhythmicity of stomata and photosynthe- 
sis (see last year’s review in this series: Liittge 2002), due to the intimate 
interaction of photosynthetic metabolism and gas exchange it is difficult 
and often impossible to separate oscillations of stomata and photosynthe- 
sis. As Giersch (1994) reminds us, discovery of ultradian photosynthetic 
oscillations dates back to more than half a century ago (van der Veen 1949). 
They have been reviewed comprehensively (Walker 1992, Giersch 1994). 
Giersch presents an intriguing diagram (Giersch 1994: see Fig. 2), which 
poses the question “where is the oscillator”? just as in the enigma of and the 
quest for the central clock in circadian rhythmicity (see Liittge 2002). 
Oscillations of about a period of 1 min are well sustained in an intact leaf, 
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highly damped in isolated protoplasts and reduced to some irregularities 
in isolated chloroplasts. Notwithstanding the possibility of artefacts due to 
isolation, this strongly suggests that oscillations are consequences of the 
integration of functional subsystems. This encompasses oscillating meta- 
bolites, such as 3-phosphoglyceric acid, ribulosebisphosphate, 
NADPH/NADP, ATP/ADP and inorganic phosphate (Giersch 1994) and the 
regulatory metabolite fructose 2,6-bisphosphate (Stitt et al. 1988). The 
detailed metabolic mechanism of the oscillations to date is not clear 
(Giersch 1994). One explanation was temporary imbalance in the supply of 
NADPH and ATP to the photosynthetic reductive carbon cycle (Laisk et al. 
1991), hence, imbalance between photochemical electron transport and 
C02-fixation capacity (Stitt and Schreiber 1988; Stitt et al. 1988). In this 
respect, however, Raghavendra et al. (1995) found interesting differences 
between the C 3 plant Spinacia oleracea and the C 4 plant Amaranthus 
caudatus . In C 4 plants oscillations are rarely observed under conditions 
readily producing oscillations in C 3 plants, i.e., dark/light-transitions at 
saturating CO 2 , while oscillations occur after a sudden dark/light-transition 
at a lower light intensity, which would not allow oscillations in C 3 plants. It 
is suggested by these authors that in Amaranthus water oxidation does not 
occur as synchronously with CO 2 uptake as in Spinacia and that, by contrast 
to the C 3 plant, in the C 4 -plant photosynthetic oscillations are due to 
co-operation of different cell types with the diffusional transport of photo- 
synthetic intermediates between mesophyll and bundle-sheath cells. 

Oscillations of stomata and photosynthetic output parameters begin 
after abrupt changes of external parameters such as light, CO 2 , 02 , H 2 O and 
temperature (Walker 1992; Giersch 1994). However, there are also sponta- 
neous stomatal oscillations (Raschke 1975). In a seminal review, Giersch 
(1994) considers the various possible metabolic control points and dis- 
cusses two different mathematical minimal or “skeleton” models (for ra- 
tionale also see Hiitt and Liittge 2002) simulating oscillations. One model 
is based on a “two kinase-hypothesis” (see also Giersch et al. 1991), with 
phosphoglycerate and phosphoribulose kinases, and another one on a 
“coupling-hypothesis”, with the interdependence of photosynthetic elec- 
tron flow and photophosphorylation, and evidence is in favour of the first 
of these two hypotheses. 

Oscillations can be a first step on the route to (deterministic) chaos 
(Giersch 1994). A theoretical hydraulic model predicted the possibility of 
Hopf bifurcation in stomatal responses (Rand et al. 1981). Chaos can be 
studied in entrainment experiments where the responses to externally 
imposed oscillations of different frequencies are analyzed. In this way 
chaotic time series where obtained (1) of electrical membrane activity by 
rhythmic stimulation of Nitella cells (Hayashi et al. 1982; see Sect. 2.1), (2) 
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of glycolysis by rhythmic supply of glucose (Markus et al. 1984), and (3) of 
peroxidase-oxidase activity by rhythmic supply of oxygen (Olsen 1983; 
Olsen and Degn 1985). 

For similar studies of ultradian oscillations of stomata and photosynthe- 
sis external oscillations of O 2 (Brogardh and Johnsson 1974; Cardon et al. 
1994a) and especially light are used. Fast Fourier transform analysis (FFT) 
and resonant analysis approach (RAA) can then be applied to overt time 
series. RAA has been used to investigate kinetics of membrane-associated 
processes (Hansen 1978, Hansen et al. 1987, Vanselow et al. 1989). It 
unravels hidden oscillatory components and quantifies some parameters 
of feedback control systems. Thus, using extracellular bioelectric measure- 
ments of plant leaves under the influence of rhythmical illumination, 
Shabala (1997) was able to distinguish the light-induced bioelectric re- 
sponses of the photosynthetically active mesophyll, i.e., a resonant mode of 
a 1.5-min period linked to thylakoid-related processes as well as a feedback 
interaction involving the membrane potential and the cytosolic pH in 
mesophyll cells (resonant mode at a period of 8 min), from stomatal move- 
ments based on ion exchange of the guard cells (resonant mode for a period 
of 40 min). Generally, stomata-related oscillations of CO 2 assimilation 
(Siebke and Weis 1995a) appear to have much lower frequencies than 
photosynthetic CO 2 fixation (Walker et al. 1983; Laisk et al. 1991; Walker 
1992). The RAA-studies of Shabala et al. (1997a) revealed bifurcations of a 
period-doubling cascade, i.e., the Feigenbaum route to chaos, and other 
routes to chaos are also discussed. 

Synchronization/desynchronization of individual oscillators or leaf 
patches also plays a role in whether or not overall gas exchange of a leaf is 
seen to oscillate. Cardon et al. (1994b) have demonstrated patchy distribu- 
tion of stomatal oscillations in sunflower using chlorophyll fluorescence 
imaging and gas exchange techniques. Siebke and Weis (1995a, b) used 
chlorophyll fluorescence imaging. With the assimilation images obtained 
they observed that rapid changes in gas composition initiated oscillations 
in net gas exchange (H 2 O and CO 2 ) which changed to non-synchronous 
oscillations due to slight local variations in the period. Obviously oscilla- 
tions in net gas exchange died out, but assimilation persisted oscillating 
non-homogenously and non-synchronously over areas, patches or spots of 
the leaves (Siebke and Weis 1995a). Gas diffusion within the leaf may play 
a role (Cardon et al. 1994b; Siebke and Weis 1995a), but minor vein distri- 
bution and its mediation of transport processes (e.g., sugar export) also 
appears to be involved (Siebke and Weis 1995b). 
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2.5 Macroscopic Events of Turgor Movements: 

Leaves, Roots and Circumnutation 

Among the macroscopic events of turgor movements the most spectacular 
ones appear to be the pulvinar movements of leaflets of Desmodium moto- 
ricum with a period length in the minute range related to oscillations of 
apoplastic K + and H + and electrical activities (Antkowiak and Engelmann 
1989, 1995; Antkowiak et al. 1991; Antkowiak and Kirschfeld 1992; for 
circadian pulvinar movements see also last year’s review by Liittge 2002). 

Another intriguing phenomenon is the rhythmic firing movement (ca. 
7 min) of the stamens and style column of Stylidium graminifolium (Find- 
lay and Findlay 1981). 

Nutation of tendrils, shoots and roots is related to localized rhythmically 
changing turgor and perhaps also extension growth (see Sect. 2.6.). Nuta- 
tions may occur as pendulum-type movements restricted to one plane or 
as elliptical or circular movements around the longitudinal axis of growth 
(circumnutation). Oscillatory nutation is known since Charles Darwin 
(Darwin and Darwin 1880) and has been documented and reviewed regu- 
larly (Gradmann 1926; Johnsson and Heathcote 1973; Johnsson 1979; 
Brown 1993; Barlow et al. 1994). Nutational oscillations with period lengths 
in the range of about one to a few hours can be overlain by so-called 
micronutations with a five to ten times higher frequency and a very small 
amplitude of maximally 1 mm (Heathcote 1966). Oscillations with very 
different frequencies exist simultaneously. 

Rhythmic nutation was originally closely related to gravity responses in 
a gravity feedback model because with larger deviations from the vertical 
plumb-line the gravity vector becomes important (Israelsson and Johnsson 
1967; Johnsson and Heathcote 1973; Andersen 1976; Johnsson 1979). How- 
ever, studies under near weightlessness in space show that nutations persist 
independent of gravity although gravity may somewhat modulate them 
(Antonsen et al. 1995; Johnsson 1997), which is also supported by circum- 
nutation observed in the absence of the gravity-sensing root cap (Shabala 
and Newman 1997). This suggests operation of an internal oscillator as 
already assumed in a model emanating from Darwin and Darwin (1880) in 
contrast to a gravitropic overshoot model (Israelsson and Johnsson 1967). 

Hence, we must look for an endogenous oscillatory process to under- 
stand circumnutation. In roots this is clearly localized to the elongation 
zone (see Sect. 2.7) and correlated to H + flux oscillations, i.e., H + exchange 
between the cytosol and the cell wall (Shabala and Newman 1997). In 
twining shoots of Phaseolus vulgaris changes in the distribution of K + and 
putative related variations of water content and turgor in the cells of the 
bending zone were related to the movement (Badot et al. 1990; Care et al. 
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1998). In tendrils of Ipomoea extracellular electrical potential variations 
had a period in accordance with the nutations (Claire et al. 1985). 

In fact, for the nutational movements it is required that reactions travel 
around the plant organ and that there are waves of coupling/uncoupling of 
adjacent cells as for K + concentrations (Badot 1987; Badot et al. 1990), pH, 
Ca 2+ , phosphate and chloride (Badot 1987). This has been described by 
nonlinear reaction-diffusion equations on an annulus corresponding to a 
ring of cells in a cross section of the circumnutating organ (Lubkin 1994; 
Johnsson 1997). 

The role of Ca 2+ is central for these oscillations. Based on studies of 
tendrils and noting the importance of compartmentation and the involve- 
ment of Ca 2+ and calmodulin-regulated activities of the cellular cytoskele- 
ton (Engelberth et al. 1995), Weiler (1997) developed the model of an 
autonomous calcium oscillator in plants. 



2.6. Extension Growth 

Extension growth is associated with H + extrusion into the apoplast in the 
elongation zone. Elongation shows ultradian rhythms in both shoots (Kris- 
tie and Jolliffe 1986) and roots. 

Around roots there are oscillating electrical fields now known for almost 
fifty years (Scott 1957, 1962; Jenkinson and Scott 1961; Jenkinson 1962). 
They are strongest in the elongation zone (Toko et al. 1990, Shabala and 
Newman 1998). In bean roots they correlate in their period (6 to 12 min) 
with oscillations of elongation (Souda et al. 1990). There are oscillations of 
longer periods (4 and 7 h in roots of Trifolium repens , Macduff and Dhanoa 
1996; 1.5 h in corn roots, Shabala et al. 1997b) and shorter periods (7 min 
in corn roots, Shabala et al. 1997b), i.e., different lengths of time in the same 
tissue. The ionic basis is given by H + and Ca 2+ ions. H + and Ca 2+ oscillations 
are closely correlated (Shabala et al. 1997b, Shabala and Newman 1998). 
Shabala et al. (1997b) argue that the fast 7-min oscillations in corn roots are 
due to oscillations of the activity of electrogenic H + - ATPase, while the slow 
1.5-h oscillations originate from physical coupling of passive H + trans- 
porters. The spatial periodic pattern along roots disappears under anoxia 
and depends on aerobic metabolism most likely because of respiration-de- 
pendent H + -pumping (Yoshida et al. 1988; Hecks et al. 1992). 

Not only transport and electrical activities at the root surface show 
ultradian oscillations but also transport across the roots. Xylem exudation 
displays 8-min pulses, where Gd 3+ -inhibited Ca 2+ -channels are involved 
(Schwenke and Wagner 1992). 
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3 Theory: “What Can We Learn from Ultradian Rhythms?” 

In the sections above, describing phenomena of ultradian rhythms in 
plants, several keywords from the realm of nonlinear dynamics have come 
up, such as feedback, waves and nonlinear reaction-diffusion equations, 
synchronization, networks and many more. Routes to chaos with bistabil- 
ity, Hopf bifurcations and Feigenbaum series have been touched. A closer, 
more theoretical examination of these significant notations can tell us more 
about what we can generally learn from ultradian rhythms. We start from 
sorting the confusingly entangled topics related to these keywords. 

Whenever one attempts to formulate a mathematical model of a biologi- 
cal system and relates the dynamic variables (e.g. metabolite pools, popu- 
lation sizes, concentrations) via linear interactions (fluxes, migration, dif- 
fusion processes, etc.), the theoretical description takes place within the 
realm of linear systems theory. Only when nonlinear interactions [feedback 
loops, (auto)catalytic effects, enzymatic regulations, cooperation, etc.] are 
included one enters the domain of nonlinear dynamics. This step leads to 
a far more complex repertoire of dynamical features possible in the model 
system, in particular stable (limit cycle) oscillations and, under certain 
conditions, deterministic chaos. When a next stage of complexity, namely 
noise and fluctuations, is taken into account one is confronted with addi- 
tional dynamical effects. Noise can destroy order or, in certain nonlinear 
systems, it may create order. Noise-induced transitions and even noise-en- 
hanced structure formation have been explored in theoretical model sys- 
tems (Garcia-Ojalvo and Sancho 1999; Jung and Mayer-Kress 1995) as well 
as in nature (see, e.g., Beck et al. 2003 for an overview of recent activities in 
this field). Ever since the first theoretical characterization (Lorenz 1963) of 
deterministic chaos, attempts have been made to find such irregular, yet 
minimally generated (low-dimensional) dynamics in a natural system. 
While some of the early findings now have to be regarded as “random” (i.e. 
without a low-dimensional deterministic process behind it), in many cases 
we can now be certain that a given system, indeed, displays chaotic behav- 
iour (see, e.g., Mosekilde and Mouritsen 1995 and Kendall et al. 1997, and 
the biological examples in Strogatz 1994 and Kaplan and Glass 1995). In 
spite of such success, it remains unclear to what extent biological systems 
functionally exploit this behaviour (Olsen and Degn 1985; Kuznetsov and 
Rinaldi 1996; Glass 2001; see also Fox and Hill 2001 for an interesting new 
point of view in this debate). One could also imagine these forms of 
dynamics to be only futile additions to the physiologically important regu- 
lational properties arising from the dynamical complexity of efficient regu- 
lation. In fact, similar questions have been posed with respect to ultradian 
oscillations themselves in the early 1980s (see the discussion in Hess and 
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Markus 1985; Olsen and Degn 1985). At that time it seemed possible that 
feedback regulation necessary for generating a stable steady state leads to 
stable oscillations in a certain range in parameter space as a by-product, 
which has no physiological significance. The remarkable property of fre- 
quency-coding in calcium oscillations has much contributed to appreciat- 
ing ultradian oscillations as an important functional feature of biological 
systems (Woods et al. 1986; Meyer and Stryer 1991; Bock and Ackrill 1995). 
Indeed, it is well possible that even chaotic dynamics, if observed, serves a 
biological function. Adaptivity and plasticity may be important keywords, 
when discussing the value for a biological system to display chaos (Kuznet- 
sov and Rinaldi 1996; Sole et al. 1996;). 

Why are such questions important for biology? While noise and chaos 
often seem indistinguishable at a first glance, a major difference exists: For 
a chaotic system, one may have the hope to fully incorporate the dynamics 
in a (low-dimensional) deterministic mathematical model, whereas noise 
can only be accounted for by stochastic forces acting upon the deterministic 
part of the system. The discussion becomes even more difficult due to the 
complex interplay between noise and the dynamics of a natural system. 
Noise is a ubiquitous phenomenon and, in most cases, a disturbance to the 
measuring process. From the point of view of nonlinear dynamics, however, 
it may very well influence a system’s dynamics in a more sophisticated way 
than just by overshadowing it. It is possible that some biological process 
(e.g., the detection of a signal) is only functioning in an optimal way at some 
intermediate noise level present in the system (Douglass et al. 1993; 
Bezrukov and Vodyanoy 1995; Gammaitoni et al. 1998; Anishchenko et al. 
1999; Moss 2000). With lower or higher noise intensity the efficiency of the 
process will decrease. A schematic example of such a stochastic resonance 
is given in Fig. 1. This phenomenon is possible only in a nonlinear system 
(in our schematic example of Fig. 1, the threshold serves as nonlinearity). 
It is by now theoretically well understood and has been observed, together 
with the related effect of coherence resonance (Pikovsky and Kurths 1997), 
in a variety of natural and model systems (Longtin et al. 1991; Moss et al. 
1994; Bezrukov and Vodyanoy 1995; Lee et al. 1998; Beck et al. 2001). Thus, 
in the last years it has been appreciated that unveiling the effect of noise in 
a system can substantially contribute to the understanding of the processes 
within the system. This development has taken the focus away from chaos. 
Nevertheless, it is still true that finding a chaotic behaviour in a natural 
system gives a lot of information on how to formulate a mathematical 
model of it. 

Often one considers not only a single model system, but rather the 
collective behaviour of many identical (or almost identical) copies of such 
a system. Such investigations are situated in the vast domain of self-organ- 
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Fig. 1. Schematic example of a system displaying stochastic resonance. A signal s(t) consist- 
ing of a sine wave and additive noise enters some detection mechanism with a threshold; a, 
c and e denote different noise intensities a . At each point in time, the signal is translated 
into “yes” (y) or “no” (n), depending upon whether the signal reaches above the threshold 
or not (b, d and f). One sees that the sequence of maxima present in the pure signal (i.e. in 
the sinusoidal oscillation without noise) is best reproduced by this device at an intermediate 
noise intensity. This visual impression is quantitatively supported by the (cross)correlation 
coefficient of the pure signal with the detected signal as a function of the noise intensity a 
(right-hand side). In a natural system, the more frequent case is that a (almost) pure signal 
enters a system with a noisy threshold. Mathematically, these two cases are equivalent. 
(Adapted from Hiitt and Liittge 2002) 
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izing complex systems . One can then study the effect of noise on the 
dynamics of individual nonlinear elements and on the coupling between 
these elements in the formation of spatiotemporal patterns. 

A yet more complicated setting emerges, when the concept of networks 
enters the description lined out so far. There are basically two distinct 
complications arising from the network aspect of interacting systems. First, 
the individual model system designed with the methods from nonlinear 
dynamics may, in a realistic biological situation, only be a part of a meta- 
system far more complicated than the isolated piece for which a mathemati- 
cal description has been attempted. Second, the step from the dynamics of 
the individual model system to the behaviour of many identical (or similar) 
copies may require arrangement of the elements in a more complicated way 
than, e.g., on a regular grid. And this arrangement (the “architecture” at the 
system) may strongly influence the dynamical properties of the system (e.g., 
whether synchronization takes place, or how the system responds to exter- 
nal perturbations). 

In light of this general background, one can now formulate the theoreti- 
cal aims of investigating ultradian rhythms. From our point of view, three 
general questions are in the centre of interest in such a theoretical analysis. 
First: time scales . Which properties of the system determine the time scales 
present in the system? Can one find “universal time scales” (or ranges of 
time scales) shared by all or most systems capable of generating ultradian 
rhythms? Under which rules do time scales coexist? What observable prop- 
erties are the result of interference of such coexisting time scales? Second: 
responses. How robust are ultradian rhythms with respect to external per- 
turbations (e.g. external periodic driving or external noise)? Is such a 
“robustness”, if it exists, a system property following immediately from the 
physiological realization of ultradian rhythms? Can a lack of robustness be 
seen as a key difference to circadian rhythms? Third: relation to circadian 
oscillations . Can one construct circadian rhythms (or certain forms of 
circadian oscillation) from ultradian rhythms (for example in the sense of 
producing a robust circadian oscillation by interference of many fast - 
ultradian - oscillations)? Some of these questions can be elucidated using 
simple mathematical model systems. This use of minimal models fre- 
quently applied to biological systems (see, e.g., Hiitt and Liittge 2002) has, 
however, a certain complication in the case of ultradian rhythms. As 
pointed out in sections 1 and 4, evolutionary pressure seems to act less 
strongly (or less coherently) on ultradian rhythms than, e.g. on circadian 
oscillations. One, therefore, expects a much wider range of physiological 
realizations. One of the key strengths of mathematical modelling, its unify- 
ing power, consequently seems not to apply. Indeed, comparatively small 
differences between the observed dynamics become much more important, 




High Frequency or Ultradian Rhythms in Plants 



249 



as they may provide the key for distinguishing between the possible physi- 
ological mechanisms. Such small differences may be given by frequency 
ranges, curve forms, phase shifts between dynamic variables, possible 
additional forms of dynamics and their basins of attractions. Nevertheless, 
from a theoretical point of view the discussion of “minimality” is still an 
important issue in the study of ultradian rhythms. Using simple model 
systems one can see how a generic oscillator responds to a change of internal 
properties or external conditions. 

How are ultradian rhythms related to the theoretical framework outlined 
above? In our opinion the key to putting this biological phenomenon into 
such a global theoretical perspective lies in the term “robustness”. 

When methods from nonlinear dynamics are applied to model or ana- 
lyze ultradian rhythms an important topic is that of biological relevance of 
complex dynamical phenomena. In Section 2 we described a wide variety 
of empirical findings related to such complex dynamics. In the following, 
we will demonstrate that the notion of robustness is crucial for evaluating 
the relevance of such findings for biological systems in vivo. We have seen 
that a considerable amount of empirical investigations of ultradian 
rhythms focuses intensely on the notions and concepts of nonlinear dynam- 
ics. In particular the phenomenon of deterministic chaos and the different 
transitions from regular (e.g., oscillatory) to chaotic dynamics (the “routes 
to chaos”) are frequently discussed. With a certain minimal model of a 
period-doubling route to chaos, the logistic finite-difference equation (or 
“logistic map”) we will demonstrate what we understand by robustness. The 
most important future observation will be, how the bifurcation pattern 
changes when two such systems are interacting, and what properties of this 
route to chaos are robust with respect to such an insertion of additional 
degrees of freedom. 

Biochemical reactions rarely occur as isolated phenomena. They rather 
are part of an entire network of interacting reaction channels and signalling 
pathways. This particular perspective, that the network structure of a 
biological system or biochemical process regulates system properties, has 
emerged in the last few years in the theoretical literature (Watts and 
Strogatz 1998; Albert and Barabasi 2002; Ravasz and Barabasi 2003) also 
because first full analyses of the topology of a real biological network has 
been performed (Jeong et al. 2000). One remarkable result of these recent 
investigations is that in contrast to theoretical expectations (see, e.g., the 
pioneering work of Kauffman 1974) only very few cases of deterministic 
chaos have been observed in real biological systems under in vivo condi- 
tions (i.e., when the entire network of interacting biological and biochemi- 
cal processes is taken into account; see Fox and Hill 2001). 
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The concept of including additional degrees of freedom always comes 
into play, when the system (either in an experiment or represented in terms 
of a mathematical model) should not be considered as an isolated entity but 
rather as a subsystem within a complex network of systems permanently in 
interaction with each other. Such an additional layer of complexity arises 
in biological systems in two cases: First, when instead of the system at hand 
the spatial arrangement of many such systems becomes important. In this 
case, the synchronization and desynchronization of these systems have to 
be taken into account, when discussing the overt (integrated) dynamics. 
Instead of purely temporal dynamics, one then has to consider the spa- 
tiotemporal dynamics of the full system (Rascher et al. 2001; Beck et al. 
2003). Second, when the system under consideration is part of a larger 
system interacting with the subsystem. In this case the network properties 
will influence the dynamics produced by the subsystem (Hartwell et al. 
1999; Ravasz and Barabasi 2003). 

Sometimes the influences of the system as a whole on a subsystem are 
acting on very different time scales than relevant for the discussion of the 
subsystem. In such cases the effect of the whole system on the subsystem 
can be approximated either by noise entering the subsystem (when the 
influence is restricted to fast dynamics) or as a drift in the subsystem’s 
parameters (when the influence of the full system upon the subsystem 
consists of slow dynamics). 

In all these cases the comparability of the model system (given either in 
terms of an experimentally isolated copy of a certain part of the biological 
system or as a mathematical model describing such a subsystem) with the 
real biological situation has to be studied carefully. 

As already mentioned above, for estimating the biological relevance of 
a route to chaos it is instructive to look at one particular case, where 
robustness of a dynamical phenomenon with respect to increasing the 
complexity of the system is particularly obvious, namely the famous period- 
doubling route to deterministic chaos. To this end we extend the usual 
logistic finite-difference equation (or “logistic map”) 

x t + 1 = fx (x t ) =R X Xt( 1 -X t ) ( 1 ) 

to a system consisting of two coupled equations 

Xt+ 1 — fx (xt ) + D [fy (yt) - fx (*t)] 

yt+i = fx (yt) + d [f x (x t ) - fy (yd) 
or, more explicitly, 

x t+ i = (1 -D) R x x t (l-x t ) + DR y y t (1 -y t ) 
yt+i = (1 - A) Ry ( 1 —)’i) 3“ DRxXt ( 1 ~Xt) 



( 3 ) 
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The coupling terms (i.e. all terms proportional to the coupling constant D) 
could be named “migration terms” when keeping in mind the original 
biological motivation of the logistic map from population biology. The 
dynamics of the simple logistic map, Eq. (1), as well as some remarks of its 
role in plant biology can be found in (Hiitt and Liittge 2002). Due to the rich 
dynamics produced by Eq. (1) it is an excellent system for getting used to 
some of the concepts of nonlinear dynamics. Here, we will only very briefly 
mention its most important properties. For a more thorough review on this 
particular equation, see May (1976), Kaplan and Glass (1995) and Hiitt 
(2001). Equation (1) contains a single control parameter, R x , which regu- 
lates the general properties of the time evolution of the dynamical variable 
Xu In order to fully determine the time course of xu the value of R x has to 
be specified, as well as a starting value (the “initial condition”) xo. One can 
now study the dynamic behaviour of Eq. (1) under variation of R x . The 
entire repertoire of possible time evolutions can be condensed into a single 
representation, the bifurcation diagram , which is shown in Fig. 2, together 
with three examples of time series. At low R x the system has a stable fixed 
point, i.e. iteration transports all starting points to a value xf, where it 
remains with continuing iteration. The value of xf only depends slightly on 
R x . At a certain value of R x , however, the fixed point becomes unstable and 
a period-2 cycle takes over. Now the size (or amplitude) of the oscillation 
depends on the distance from this bifurcation point: As R x increases, the 
amplitude becomes larger. At a further increase of R x a series of period- 
doubling bifurcations appearing at ever smaller intervals in R x takes the 
system via a period-4 cycle, a period-8 cycle, etc., to an irregular behaviour 
called deterministic chaos. In this regime small differences in the initial 
value lead after a few time steps to completely different (i.e. uncorrelated) 
time series. 

Starting from this well-known model system of nonlinear dynamics, we 
can now formulate our thought experiment. Let us assume that Eq. (1) 
represents, to a certain extent a model of an ultradian oscillator: In a certain 
range of the control parameter R x a stable oscillation is observed, followed, 
as R x is increased, by a period-doubling route to deterministic chaos. Let 
us further assume that some upper and lower value for R x exists, which 
delimit the biologically relevant parameter range. Values of R x outside this 
range are assumed to be impossible or irrelevant for a biological system. 
Such a limitation is frequently encountered, when a description of a bio- 
logical situation is attempted. In our case, this range serves as a reference 
point for visualizing the effect of coupling on the dynamics of the individual 
systems. Figure 3 shows some bifurcation diagrams for two coupled logistic 
equations given by Eqs. (2) and (3). Starting from the original (uncoupled) 
case ( upper left diagram ), coupling is introduced and the difference be- 
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Fig. 2. Study of the dynamics produced by the logistic map, Eq. (1), for different values of 
the control parameter R x . The most important information on the R x dependence of the 
time development of x t can be summarized in a bifurcation diagram, where the possible 
values of xt are shown as a function of the control parameter R x . In the bifurcation diagram 
of the logistic map, one can immediately see the various period-doubling bifurcations, which 
dominate the dynamics over a wide range of the control parameter R x and ultimately lead 
to deterministic chaos, when R x is further increased. Time series are shown for three values 
of R X) namely R x = 3.3 (stable oscillation called period-2 cycle), R x = 3.5 (period-4 cycle) and 
R x =3.72 (deterministic chaos) 
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Fig. 3. Bifurcation diagrams of two coupled logistic maps, Eqs. (2) and (3), for different values 
of the bifurcation parameter difference Ar, coupling D and initial value difference Ao. 
Choices of parameter values are indicated as insets in each bifurcation diagram. The upper 
left diagram (uncoupled case; D= 0) corresponds to the bifurcation diagram shown in Fig. 
2. The vertical dashed lines show the assumed biologically motivated restrictions to the 
parameter range 
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tween initial values, Ao=yo-xo , is increased ( left column ), as well as the 
difference between the bifurcation parameters, AR=R y -R x . It is seen that the 
bifurcation diagram undergoes drastic changes due to coupling. In particu- 
lar, the ranges in R Xy where chaotic behaviour is observed, are in some cases 
highly suppressed. This is seen most strikingly, when one assumes biologi- 
cally motivated restrictions to the parameter range, as outlined above. 
Figure 3 contains such a (fictitious) parameter range, which for D = 0 con- 
tains a large portion of the chaotic regime. For small coupling and interme- 
diate Ar, however, all remaining chaotic behaviour lies outside this range 
( lower right diagram ). 

The case of coupled logistic equations, Eqs. (2) and (3), together with 
some implications of coupling for our understanding of biological phenom- 
ena, have been discussed by Lloyd (1995), Kendall and Fox (1998) and Sole 
and Gamarra (1998). We consider them here as a very simple model 
representation of some of the striking phenomena of nonlinear dynamics 
easier to discuss and richer in dynamics than their continuous counter- 
parts. Notwithstanding, the differential equations, finite-difference equa- 
tions, in their own right, are an interesting starting point for both, mathe- 
matical simulation and analysis in biology (see Hiitt et al. (2002) for an 
example from plant biology). 

Both from the point of view of modelling as well as from data analysis 
the general concepts of nonlinear dynamics provide some guidelines (as 
well as some caveats!) for addressing the questions outlined above. Cur- 
rently, one observes a focus on the study of isolated in vitro systems and 
relating these findings to the large catalogue of behaviours put forward by 
theoretical studies in nonlinear dynamics. In particular, the step, via simple 
interacting systems, towards complex biochemical and biophysical net- 
works may eventually help to develop a clear and systematic understanding 
of the minimal realizations and the various biological implementations of 
ultradian rhythms. 



4 Functions? 

“In search for a biological hour” Koukkari et al. (1997) list a large number 
of ultradian oscillations of bacteria, amoebae, yeast, higher plants and 
mammals with periods ranging from 30 to 240 min and averaging 95 min. 
However, an analogy to the hour appears to be a curiosity rather than a 
functional trait. The hour is not a natural environmental period. Even in 
the discussion of circadian rhythmicity, we cannot be totally sure if oscil- 
lations are physiologically important and provide increased fitness or only 
a by-product of time constants of endogenous processes evolved under 
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continuous entrainment by the natural diurnal cycle (Liittge 2002). Thus, 
several authors conclude that ultradian oscillations are merely side effects 
of regulatory systems (Giersch 1994; Siebke and Weis 1995b). 

Conversely, various cell physiological functions of ultradian oscillations 
have been suggested. They may be crucial for the synchronization and 
co-ordination of spatially separated physiological processes (Lloyd and 
Kippert 1993) as well as for the separation of incompatible processes within 
the same cellular compartment (Rapp 1987). In enzyme reactions involving 
potentially toxic reactive intermediates, oscillatory dynamics lower the 
average concentration of toxic compounds such as H 2 O 2 , superoxide and 
other reactive oxygen species in the peroxidase-oxidase reaction (see Sect. 
2.2; Hauser et al. 2001). Oscillations increase the thermodynamic efficiency 
of metabolism lowering energy dissipation (Lazar and Ross 1990). Ca 2+ 
oscillations may transmit stimulus-specific information to a downstream 
signalling machinery involving Ca 2+ -dependent protein phosphorylation 
(McAinsh et al. 1995). In general, oscillations may encode information in 
signal transduction chains or networks (Berridge et al. 1998, Shabala et al. 
2001), with, e.g., Ca 2+ , or also the H 2 O 2 signal emerging from the peroxi- 
dase-oxidase reaction, where frequency-encoded regulation is more accu- 
rate than amplitude-encoded regulation (Moller et al. 1998). That ultradian 
oscillations may represent a frequency-encoding mechanism for environ- 
mental information (Shabala et al. 2001) maybe also supported by Fourier 
transform analysis of natural light fluctuations in Tasmania, which revealed 
frequency peaks of power spectra corresponding to periods of 3, 6 and 
14 min, i.e., in the same range as in ultradian oscillations of photosynthesis 
(Shabala et al. 1997a). This is highly interesting in an evolutionary context. 
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Nutritional Aspects of Ci Unit Metabolism 
in Heterotrophic Tissues and Organisms 
(Excluding Prokaryotes) Under Particular Respect 
to Dissolved Organic Nitrogen (DON) 
from the Environment 

Hartmut Gimmler and Wolfram Hartung 



1 Introduction 

One-carbon (Ci unit) metabolism is essential to all eukaryotes and 
prokaryotes. In plants and fungi it supplies the Ci units required for the 
synthesis of amino acids, proteins, nucleic acids (purines), some vitamins, 
and methylated compounds such as lignin, alkaloids or betaines (Fig. 1). 
Thus it is involved in such different physiological aspects such as regulation 
and differentiation via DNA and vitamin metabolism, cell wall formation 
and extension (lignin and pectin), osmoregulation (betaines), secondary 
plant metabolism (alkaloids), amino acid metabolism, internal and exter- 
nal detoxification (cyanides) or utilization of N- and C-containing solutes 
abundant in the external environment. Additionally it plays an important 
role in photorespiration of photosynthetic C3 plants. 

Serine is generally accepted as the major one-carbon donor in Ci meta- 
bolism, but also glycine and formate; in some cases also cyanides can 
provide one-carbon units. In most cases the Ci transfer is mediated by the 
co-factor tetrahydrofolate (THF). THF contains pteridine, p- aminoben- 
zoate and glutamate entities and therefore its chemical name is tetrahydro- 
folate polyglutamate (FUPteGlun). The synthesis of this compound starts 
with the generation of pteridine precursors from GTP and is described in 
detail e.g. by Cossins (1987), Cossins and Chen (1997) and Rebeille and 
Douce (1999). Recently it was found that in few cases at least in bacteria also 
tetrahydromethanopterin (H 4 MPT) can function as carrier for Ci units 
(Grabarse et al. 1999; Pomper et al. 1999; Chistoserdova et al. 2000; Maden 
2000 ). 

In principle, in Ci metabolism compounds such as glycine, serine or 
formate (in bacteria and fungi also methane or methanol), by catabolic 
reactions generate various specific derivatives of THF (Fig. 1 ). Sub- 
sequently these derivatives are interconverted enzymatically between dif- 
ferent oxidation states. 10-Formyl-THF is the highest oxidized derivative, 
5-methyl-THF the most reduced derivative, 5,10-methenyl-THF and 5,10 
methylene-THF having intermediate redox states. In anabolic reactions the 
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Fig. 1. Summarizing sketch of reactions of folate biosynthesis and Ci unit metabolism in 
higher plants and fungi as covered in this review. Numbers refer to enzymes cited in Table 
1 (column 1) 
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Ci units are withdrawn from the various THF derivatives, yielding, e.g., 
purines, panthotenate, thymilidate or methionine. If Ci units originate 
from formate, also serine can be synthesized (Igambierdiev et al. 1999; 
Wingler et al. 1999). The scheme in Fig. 1 neglects the intracellular com- 
partmentation of Ci metabolism in plastids, mitochondria, peroxisomes 
and the cytosol, which has been described in detail (Cossins 1987, McNeil 
et al. 1996, Neuburger et al. 1996, Kastanos et al. 1997, Chen et al. 1997, 
Atkins et al. 1997, Bourguignon et al. 1999, Rebeille and Douce 1999 and 
Hanson et al. 2000). 

Although Ci metabolism is of crucial importance for all organisms, much 
of its details are unknown, because this metabolic pathway is difficult to 
investigate: The involved enzymes (compare Fig. 1 and Table 2) can be of 
low abundance or be inducible. They may exist in several iso-forms or 
multifunctional units. The key intermediates, the Ci-substituted THFs, 
occur only in pmolar concentrations in the cytosol and are labile, causing 
difficulties during quantitative determination. For this reason, recent pro- 
gress made in the elucidation of Ci metabolism came mainly through two 
new techniques: (1) the non-invasive 13 C-NMR-technique, which allows to 
determine in vivo fluxes of carbon through Ci metabolism (for summary 
see Mouillon et al. 1999; Ratcliffe and Shachar-Hill 2001: higher plants; 
Pasternack et al. 1992; Pasternack et al. 1994a,b, 1996; Appling et al. 1997; 
Maaheimo et al. 2001: fungi) and (2) genomic-driven approaches, which 
means identification of genes by the use of mutants with specific deletions 
(Barlowe and Appling 1990a,b; Maier-Greiner et al. 1991; Wang and Vanet- 
ten. 1992; Cluness et al. 1993; Song and Rabinowitz, 1993; Wahls et al. 1993; 
West et al. 1993; Brown et al. 1995; Kirksey and Appling 1996; Bird and 
Bradshaw 1997; Todd et al. 1997; Parpinello et al. 1998; Toyama et al. 1998; 
Rover et al. 1999; Hanson et al. 2000; Ravanel et al. 2001; Hanson and 
Shahar-Hill 2002; Nikiforov et al. 2002; compare also Table 2). 

The focus of this contribution is on physiological aspects of Ci metabo- 
lism in heterotrophic tissues of plants and in fungi. It excludes, with few 
exceptions, prokaryotes. Special attention is directed to (1) nutritional 
aspects of Ci metabolism and (2) its role in stress physiology. Beside a 
survey, it will show by two case studies that Ci metabolism is of essential 
importance for plants and fungi suffering from nitrogen deficiency and that 
by proper modulation of Ci metabolism plants and fungi can acclimatize 
to uptake and metabolisation of those organic N-containing compounds 
prevailing in their environment. Extreme stress conditions produce high 
selection pressure on organisms. This applies also to the use nutritional 
sources from the environment. Organisms which are able to use N-contain- 
ing solutes from the soil solution, which cannot be exploited by other 
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organisms competing for the same habitat, will finally dominate this eco- 
logical niche. 

Because of this special focus, another interesting aspect of Ci metabolism 
in plants, e.g. its function in photorespiration (Gardestrom et al. 1985; 
Besson et al. 1993; Neuburger et al. 1996; Douce and Neuburger 1999; 
Wingler et al. 1999) had to be omitted. It is also not the aim of this paper to 
review the biochemistry and the underlying molecular biology of Ci meta- 
bolism. Instead, we refer to the excellent reviews which have been written 
recently covering various biochemical aspects of Ci metabolism (Appling 
1991; Cossins and Chen 1997; Mouillon et al. 1999; Rebeille and Douce 1999; 
Hanson et al. 2000). 

To the best of our knowledge no corresponding articles on Ci metabo- 
lism have been published so far in Progress of Botany and we neither found 
the terms c< carbon-one metabolism ” nor “tetrahydrofolic acid” in the sub- 
ject indexes of the last twenty volumes of this series. Thus, we hope to close 
a “gap” by this review, even though the latter is not a general one, but prefers 
to put the emphasis on some selected physiological topics of Ci metabolism 
and application to two case studies. 



2 Higher Plants 

2.1 Glycine and Serine as Potential Sources of Nitrogen 
in Higher Plants: General Aspects 

Dissolved organic nitrogen (DON) in the soil provides an alternative to the 
usual inorganic forms in a wide range of plants and circumstances 
(Nasholm and Persson 2001). When nitrogen mineralisation is impaired, 
the concentration of inorganic nitrogen in the soil solution can be very low, 
and under such conditions organic nitrogen, often in the form of amino 
acids, maybe the major nitrogenous compounds available to the roots. This 
process has been shown to be an important factor in the nitrogen nutrition 
of plants in alpine and arctic habitats (Chapin et al. 1993; Kielland 1994; 
Lipson and Monson 1998; Raab et al. 1999), as well as for plants in heath- 
lands (Schmidt and Stewart 1997), boreal forests (Nasholm et al. 1998), 
coastal marsh (Henry and Jeffries 2002), grassland communities (Falken- 
gren-Grerup et al. 2000; Streeter et al. 2000; Thornton 2001) and the aquatic 
resurrection plant Chamaegigas intrepidus (Heilmeier and Hartung 2001). 
Moreover utilization of glycine has been demonstrated in agriculturally 
important species grown under field conditions, suggesting that organic 
nitrogen may be a more important source of nitrogen for plants under 
cultivation than previously thought (Nasholm et al. 2000, 2001). 
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The observation that organic nitrogen compounds, such as amino acids, 
can be absorbed by roots under field conditions at rates that can make a 
substantial contribution to the nitrogen requirement of plants, is good 
evidence for the involvement of such compounds in plant nitrogen nutri- 
tion. The existence of a range of amino acid transporters in plant roots 
suggests a likely mechanism (Fischer et al. 1998), and indeed the use of 
dual-labelled amino acids, for example glycine labelled with both 13 C and 
15 N (Nasholm et al. 1998, 2001), followed by the analysis of plant extracts 
using GC-MS provides a convenient method for investigating the contribu- 
tion of direct and indirect uptake to the intracellular nitrogen pool. Dem- 
onstrating the quantitative significance of the uptake process has been the 
priority in most of these studies, with the result that the subsequent meta- 
bolism of the absorbed nitrogenous compounds has received much less 
attention, even though the pathways for their utilization may be unclear. 
Thus while the uptake of glycine has been shown to be a significant source 
of plant nitrogen in many of these studies, the extent to which the glycine 
decarboxylase complex (GDC; EC 2.1.2.10; compare Fig. 1) might comple- 
ment the action of aminotransferases in the subsequent metabolism of the 
glycine has been investigated infrequently. In one such study, it was con- 
cluded that glycine was metabolized in the roots and cluster roots oiHakea 
seedlings via aminotransferase activity (Schmidt and Stewart 1999). This 
conclusion was consistent with earlier observations on the low GDC activity 
in pea root apices (Walton and Woolhouse 1986), but more recent data 
suggests that glycine metabolism via GDC in heterotrophic tissues may 
actually occur quite readily (Mouillon et al. 1999). 

It has been argued on the basis of the results obtained with a heterotro- 
phic Acer pseudoplatanus cell culture that metabolism through the GDC 
pathway must be an essential feature of heterotrophic plant metabolism 
(Mouillon et al. 1999), despite the generally low levels of extractable GDC 
activity in such tissues (Walton and Woolhouse 1986; Bourguignon et al. 
1993). 



2.2 Glycine and Serine as Potential Sources of Nitrogen 

for the Aquatic Resurrection Plant Chamaegigas intrepidus 

Recently data of in vivo NMR studies with roots of Zea mays and Chamaegi- 
gas intrepidus that have been incubated with 15 N- and 13 C-labelled glycine 
have been published (Hartung and Ratcliffe 2002). The experiments with 
[ 15 N] glycine provided direct evidence for the release of [ 15 N] ammonium, 
and its subsequent incorporation into glutamine and glutamate via the 
GS/GOGAT (glutamine synthetase/glutamine 2-oxoglutarat-aminotrans- 
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ferase) path- way. The first explanation for the ammonium release is that 
glycine is metabolized by glycine decarboxylase (GDC), and indeed this 
explanation is supported by both experiments using inhibitors of GDC and 
[2- 13 C] glycine. More complicated pathways for the release of ammonium 
can be envisaged, which however seem to be quite unlikely as discussed in 
detail by the authors. Secondly, the experiments with [2- 13 C]glycine con- 
firmed the involvement of GDC and showed that serine hydroxymethyl- 
transferase (SHMT), both alone and in concert with GDC, was responsible 
for the conversion of glycine to serine. These conclusions can be deduced 
from the labelling patterns observed for the serine isotopomers in the 
13 C-NMR spectrum: [2- 13 C]serine is formed by the SHMT-mediated reac- 
tion of unlabelled CH 2 -THF and [2- 13 C] glycine (compare Fig. 1); [2,3- 
13 C] serine arises from the reaction of [ 13 C] -labelled CH 2 -THF, generated 
by the action of GDC on [2- 13 C]glycine, and [2- 13 C] glycine; and [3- 13 C]gly- 
cine arises from reaction of [ 13 C] -labelled CH 2 -THF and unlabelled glycine. 
The characteristic signals of the three isotopomers are readily identifiable 
and they provide unequivocal evidence for the involvement of both GDC 
and SHMT in the metabolism of glycine. 

Thus it can be seen that GDC and SHMT are directly involved in the 
metabolism of glycine by C. intrepidus and maize root tissues, and this 
provides further support for the emerging view that GDC plays an essential 
part in heterotrophic metabolism (Mouillon et al. 1999). This conclusion is 
also significant in relation to the nitrogen nutrition of a range of plants in 
their natural habitat, since glycine is often the most or second-most abun- 
dant nitrogen source that supports the plants. The metabolism of glycine 
can act as a direct source of ammonium for the GS/GOGAT pathway and 
thus define the pathway that permits plants such as C. intrepidus to utilize 
glycine as a nitrogen source. Labelling experiments with [2- 13 C]- and 
[ 15 N]glycine point to the involvement of GDC and SHMT in the metabolism 
of glycine by the root tissues of C. intrepidus and maize. The increasing 
awareness of the contribution that organic nitrogen forms make to plant 
nitrogen nutrition in a wide range of habitats, including ones of agricultural 
importance, suggest that more detailed investigations of the pathways of 
both glycine and serine metabolism would be worthwhile. 
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3 Eukaryotic Micro organisms 

3.1 General Physiological Aspects of Ci Metabolism 
in Eukaryotic Micro-organisms 

In fungi the three-carbon of serine is generally the major one-carbon donor 
of Ci metabolism (Pasternack et al. 1994a). Occasionally glycine and for- 
mate can substitute for serine. If formate is the one-carbon donor, assimi- 
lation is primarily cytoplasmic (Pasternack et al. 1992), whereas when 
serine serves as Ci donor, metabolism occurs preferentially via the mito- 
chondria. The interconversion of serine and glycine in fungal Ci metabo- 
lism is catalyzed by serine hydroxymethyltransferase (SHMT) (EC 2. 1.2.1) 
following Eq. (1), 

Serine + H 4 PteGlu„ < serine 5,10-CH 2 -H 4 PteGlu„+glycine 

( 1 ) 

where H 4 PteGlu n is 5, 6, 7, 8 tetrahydropteropolyl-y-glutamate (= THF) and 
n the number of glutamate moieties. SHMT is localized in the cytosol as well 
as in mitochondria of fungi (Cossins 1987; compare Table 1). 

In yeast the formation of 10-HCO-H 4 PteGlu n (10-formyl-THF) and its 
conversion to 5,10-CH2-PteGlu n (5,10-methylene-THF) are catalyzed by a 
trifunctional peptide called Ci tetrahydrofolate synthetase (Cossins and 
Chen 1997; compare Table 1). This protein has three distinct domains for 

1. 10-HCO-H 4 PteGlu synthetase (EC 6.3.4.3) (Eq. 2), 

Formate+H4PteGlun 10-HCO-H 4 PteGlu n +ADP+Pi 

( 2 ) 

2. 5 , 10 -CH 2 -H 4 PteGlu cyclohydrolase (EC 3.5.4.9 ) (Eq. 3), 

10-CHO-H 4 PteGlun+H + < 5 - 10metfen > ,i - rH F - c > ,do ' , >' droias S5 > 10-CH + -H 4 PteGlu n + 

H+H 2 0 (3) 

and 

3. 5,10-CH2-H 4 PteGlu dehydrogenase (EC 1.5.1.5) (Eq. 4). 

5,1 0-CH + -H 4 PteGlu n +NADPH < ^°- methen y l - THF ~ deh y dr ^ enas % 5 , i O-CH 2 -H 4 

PteGlun+NADP 

(4) 

The Ci -tetrahydrofolate synthetase is by no means the only multifunctional 
enzyme complex in fungal Ci metabolism. Also the fungal glycine decar- 
boxylase (GCV) and the thymilidate synthetase (TS) are catalyzed by mul- 
tifunctional peptids (Table 1). 




Table 1. Biochemical studies of microbial enzymes related to Ci metabolism (compare Fig. 2; Cossins and Chen, 1997; Hanson and Shachar-Hill 
2002). Information refers mainly to fungi, but some data on bacteria, protozoa and algae are also included. For molecular biological data see Table 
2. Numbers in column 1 refer to Figs. 1 and 5 
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Table 1 summarizes recent literature of biochemical studies of enzymes 
involved in Ci metabolism and folate biosynthesis. Many genes of these 
enzymes haven been identified and sequenced lately (Table 2). 



3.2 Amides and Nitriles as Nitrogen Sources During the Adaptation 
of Fungi to Extreme Environmental Conditions 

Fe- or Co-containing nitrile-degrading enzymes (Kobayashi and Shimizu 
1998) are widely dispersed in prokaryotes and lower orders of eukaryotes, 
but not in higher plants (Miller and Conn 1980). By means of hydrolyases 
(nitrile-hydratases, NHases) some bacteria and fungi are able to detoxify 
cyanides (nitriles). Cyanide hydratase, e.g. (EC 4.2.1.66 = formamide hy- 
drolyase, FHL) converts cyanide to formamide according to 

HCN+H20 ^ornanidehydro-lyas^ (5) 

The resulting amides may be subsequently split by amidases into organic 
acids and ammonia. For instance, formamide is converted to formate and 
NH 3 according to 

HCONH2+H2O <- Form ^ - a5 V HCOOH2+NH3 ( 6 ) 

NH 3 is utilized as nitrogen source by conventional pathways via glutamate. 
A formate dehydrogenase (EC 1.2. 1.2) may liberate CO 2 from formate 

HCOOH+NAD < Form ^^^ roggM - a5e > CO 2 +NADH (7) 

In such a case formamide can serve only as N-source but not as C-source. 
However, formate may be incorporated into N 10 -formyl tetrahydrofolate 
(THF) (Rebeille and Douce 1999; Rhodes et al. 1999) according to Eq. (2) 
and subsequently used e.g. for methionine synthesis from homocysteine. 
In such a case formamide is used both as C- and N-source. NADH of Eq. 7 
can be used for the formation of ATP. 

The two-step degradation of toxic cyanides to potentially useful sub- 
strates occurs in both bacteria (Miller and Gray 1982) and fungi (Rissler and 
Millar 1977; van der Walt et al. 1993; Brown et al.1995; Dumestre et al. 
1997a, b). In yeast assimilation of amides can be used for systematical 
identification: Testing hydrolysis, e.g., of 23 different amides by 500 differ- 
ent yeasts and yeast-like strains demonstrated that 10 of these amides are 
sufficient to distinguish seven genera and 19 species (Mira et al. 1955). From 
the evolutionary view the first step may present an adaptation to the 
chemical environment in early history of the biosphere. However, also a 
correlation between FHL activity and the phytopathogenicity of some fungi 
to cyanogenic plants was observed (Fry and Evans 1977; Wang et al. 1992). 
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The occurrence of amidases may reflect a secondary adaptation, following 
the evolution of NHases. It allows the utilization of the end-products of 
NHase activities. Since nitrogen is often the limiting element in growth the 
use of any amides present in the environment is of evolutionary advantage. 
Thus, it is not surprising that some microbes are able to live with cyanides 
as the sole nitrogen source (Maier-Greiner et al. 1991; Kunz et al. 1994; 
Linardi et al. 1996; Dumestre et al. 1997a, b; Barclay et al. 1998a; Nauter et 
al. 1998; Gimmler et al. 2002) and additionally as the sole C-source (Miller 
and Gray 1982; Narayanasamy et al. 1990; Maier-Greiner et al. 1991; Dumes- 
tre et al. 1997a, b; Nauter et al. 1998). 

Table 1 compiles progress made in previous years in enzymatic studies 
of fungal Ci metabolism (see also Fig. 1), whereas the literature on identi- 
fication and sequencing of genes of such enzymes is listed in Table 2. For 
information on corresponding enzymes of higher plants see Hanson et al. 
(2000). In Table 3 recent studies on microbial utilization of selected organic 
substrates related to Ci metabolism are summarized with emphasis on (1) 
utilization of DON under N-limiting conditions, (2) detoxification of sol- 
utes of toxic potentials in organisms, (3) compounds of environmental 
concern such as climate gases, (4) mutualistic relations between different 
organisms. 



3.3 Amides and Nitriles as Nitrogen Sources 

in the Extreme Acid-Tolerant Fungus Bispora sp. 

Considering the fact that many polluted soils, industrial wastes and efflu- 
ents contain considerable amount of various toxic cyanides, biodegrada- 
tion of nitriles by microbes is of vast interest both from an environmental 
and health perspective. Therefore the potential of micro-organisms con- 
taining nitrile-hydratases and amidase for phytoremediation is studied 
with increasing intensity (Padmaja and Balagopal 1985; Narayanasamy et 
al. 1990; Shanker et al. 1990; Silva- Avalos et al. 1990; Shah et al. 1991; Barclay 
et al. 1998a, b; Dumestre et al. 1997a, b; Pereira et al. 1997; Cowan et al. 1998; 
Kobayashi and Shimizu 1998; Nauter et al. 1998). Recently it could be shown 
that the stress-resistant fungus genus Scytalidium y which comprises ther- 
mophilic and acidophilic species (Sigler and Carmichael 1974) is able to 
grow on metal-complexed cyanides, typically found in former gasworks 
sites (Barclay et al. 1998a). The genus Scytalidium is closely related to the 
recently isolated, extremely acid and heavy metal-resistant filamentous 
fungus Bispora sp., which has been investigated also in this study. Stress 
physiology of the latter fungus was recently analyzed in our department 
(Gimmler 2000, 2001; Carandang et al. 2001; Gimmler et al. 2000, 2001, 




Table 2. Molecular biological studies of microbial enzymes related to Ci metabolism and folate biosynthesis (compare Fig. 1; see also Cossins and 
Chen 1997). Information refers preferentially to fungi, but few data on bacteria, protozoa and algae are also included. For biochemical data see 
Table 1 
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Table 3. Microbial utilization of low-molecular substrates related to carbon one metabolism as source for nitrogen and/or carbon. Data in paratheses: 
Restricted to certain growth conditions.Most of the information refers to fungi, but some bactera are also included 
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2002). Because of its strong acid and metal resistance, Bispora sp. may be a 
good candidate for phytoremediation of polluted soils, e.g. former gas- 
works sites and industrial wastes. 

Indeed, Bispora sp. is able to grow on metal-complexed cyanides as the 
sole N-source, even though growth rates were considerably lower (about 
30%) than in the presence of NH 4 + (Fig. 2A) or nitrate (not shown). Also 
putative products of NHase activity, namely short-chain amides, can serve 
as growth substrates in Bispora sp.: At both acid and neutral pH the fungus 
exhibited good growth with ammonium and acetamide as N-source 
(Fig. 2B), whereas the growth was much lower with formamide. Bispora sp. 
is unable to utilize urea as the nitrogen sources (Fig. 2C). Growth on 
acetamide was resumed only after a lag period of several days (Fig. 2C). 

The nitrogen content of control samples (NH 4 + ) and those grown on 
acetamide and formamide were relatively similar, whereas that of cultures 
deprived of nitrogen contained much less nitrogen (Fig. 3). The molar 
ratios carbon/nitrogen were close to 10 in control cells and in cultures 
grown with acetamide, but 2.5-fold higher in cultures deprived of nitrogen 
(Fig. 3B). C/N ratios in formamide cultures are significantly higher than 
those of control cells, but also significantly lower than in cells deprived of 
nitrogen. The data thus indicates that Bispora sp. exhibits good growth and 
a reasonable N-status with acetamide as sole N-source and glycerol as 
C-source. With formamide as sole N-source the N-status satisfactory, but 
the growth rate is reduced. This may indicate that ammonia deriving from 
a putative formamidase reaction is well used, but that C-supply is not 
adequate under these conditions. 

Bispora sp. grown at pH 7.0 with NH 4 + as N-source and glycerol as 
C-source neither expresses urease (EC 3.5. 1.5) nor formamidase (EC 
3.5.1.9) or acetamidase (EC 3.5.1.14) (Table 4). This agrees with the obser- 
vation that in general NH 4 + prevents the expression of these enzymes in 
filamentous fungi (Tonon et al. 1990). However, when Bispora is cultured 
in the presence of formamide as N-source and glycerol as C-source a 
formamidase is expressed (Table 4) even though growth is low under such 
conditions (Table 4, compare Fig. IB). Surprisingly, when cultured with 
acetamide as N-source and glycerol as C-source, which permits good 
growth (Fig. 2B), neither acetamidase nor an urease, but a formamidase is 
expressed (Table 4). 

Bispora sp. takes up 14 C-acetamide into control cells and cells pre-cul- 
tured with acetamide, but much less into cells pre-cultured with formamide 
(Fig. 4). This effect is observed both in cells pre-cultured with (Fig. 4A) and 
without glycerol (Fig. 4B) as additional carbon source. Part of the radioac- 
tivity from 14 C-acetamide is incorporated also into the water-insoluble, 
pelletable fraction of the fungus (not shown). TLC analysis reveals that 
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NH 4 + [K 3 CO(CN)J [K 4 Fe(CN)g] 



Control KN0 3 Formamide Acetamide 




Time (d) 

Fig. 2. Nitrogen sources of the filamentous fungus Bispora sp. (adapted from Gimmler et al. 
2002). A Growth on NH 4 + and metal complexed cyanides as the sole nitrogen source (pH 7.0, 
glycerol as carbon source). B Growth with NH 4 (control), nitrate, formamide and 
acetamide as N-source (glycerol as carbon source) at pH 1.0 and 7.0. C Kinetics of growth 
with acetamide or urea as the sole N- and C-source (pH 7.0) 
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Control -N Formamide Acetamide 

(+ NH 4 + ) 



Fig. 3. A N-content and B molar C/N ratio in the filamentous fungus Bispora sp. as affected 
by the pH of the growth medium and various N-sources (carbon source: glycerol). (Adapted 
from Gimmler et al. 2002) 



products labelled from 14 C-acetamide are (glycine+serine), malate, alanine 
and methyl-THF (Fig. 4C). Bispora sp. also incorporates 14 C-formate into 
the insoluble, pelletable fraction independent on the pH of the growth 
medium (Fig. 5). 

In experiments crude enzyme extracts of Bispora, the subsequent TLC 
analysis demonstrated that the label of the substrates 14 C-formate and 
14 C-methyl-THF are incorporated into THF and glycine/serine, respec- 
tively. 

In summary, results of this case study indicate that the acid and heavy 
metal-resistant filamentous Bispora sp. can use acetamide as sole nitrogen 
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Table 4. In vitro amidase and urease activity of Bispora sp. as affected by the N-source during 
the pre-culture and the substrate during the assay 



Experiment 

number 


Pre-culture of 
the fungus in 


Substrate during 
the measurement 


Amidase activity 
measured as change 
in conductivity 
(A pS h _1 pg _1 protein) 


I 


NH 4 + (control) 


Formamide 


0 






Acetamide 


0 






Urea 


0 




Formamide 


Formamide 


0.064±0.42 






Acetamide 


0 




Acetamide 


Formamide 


0.121±0.008 






Acetamide 


0 


II 


Formamide 


Urea 


0 




Acetamide 


Urea 


0 




Formamide 


Formamide 


0.046±0.006 






Acetamide 


0 




Acetamide 


Formamide 


0.033±0.003 






Acetamide 


0 



and carbon source (Gimmler et al. 2002). To a lesser extent the fungus can 
also assimilate formamide and metal-complexed cyanides, whereas it is 
totally unable to assimilate urea. 

It is unknown whether acetamide and formamide are taken up by a 
catalyzed transport across the plasma membrane or by simple diffusion. 
Both is thermodynamically possible. Some authors postulated that 
acetamide and formate are able to enter cells via aquaporines (Meinild et 
al. 1998). The reflection coefficient for acetamide was measured to be close 
to 0.91 and that for formamide close to 0.80. 

After entry into the cells acetamide is immediately converted to other 
metabolites, indicated by a very low cytosolic pool of acetamide. The same 
is assumed to occur with formate. The supposed fate of both amides in 
Bispora (Fig. 6) is in principle agreement with data of the literature (Fig. 1). 
However, somehow unusual is the apparent absence of a classical acetami- 
dase. Acetamide is supposed to be split into formamide and a Ci unit. The 
latter is assumed to be fed into the Ci metabolism by a methyl-tetrahydro- 
folate synthetase, which relates it to the metabolism of glycine, serine, and 
methionine (Pasternack et al. 1994a,b, 1996; Mouillon et al. 1999; Rebeille 
and Douce 1999; Hanson et al. 2000). The internal level of folic acid in 
Bispora sp. is about 0.5 pg per gram of fresh weight (not shown), corre- 
sponding to about 1.1 nmol/g fresh weight. This level is comparable to that 






Incorporation of 14 C-acetamide Incorporation of 14 C-acetamide 

(pmol g* 1 FW) (pmol g 1 FW) 



Fig. 4. Incorporation of 14 C-labelled solutes into intact cells of the filamentous fungus 
Bispora sp. (adapted from Gimmler et al. 2002). A Time-dependent uptake of 14 C-acetamide 
(final concentration 0.1 mM) into the ethanol-soluble fraction of Bispora sp. in the presence 
of different N-sources with glycerol as carbon source. B Time-dependent uptake of 14 C- 
acetamide (final concentration 0.1 mM) into the ethanol- soluble fraction of Bispora sp. in 
the presence of acetamide or formamide as N- and C-sources. C TLC analysis of the 
incorporation of 14 C-acetamide (AA) (4 h) into ethanol-soluble compounds of Bispora sp. 
(pre-culture: a) + glycerol + AA - NH 4 + ; b) + glycerol + AA + NH 4 + ; c) - glycerol + AA + 
NH 4 + . Compound 5 in AA from co-chromatography 
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Incubation time (min) 

Fig. 5. Time-dependent incorporation of 14 C-formate (final concentration 0.01 mM) into 
the insoluble, pelletable fraction of Bispora. sp. at two different pH values of the growth 
medium. (Adapted from Gimmler et al. 2002) 




Fig. 6. Putative scheme of uptake and assimilation of acetamide, formamide and metal 
complexed cyanides in the acid and heavy-metal-resistant filamentous fungus Bispora sp. 
and its relation to N-metabolism and Ci metabolism as treated in this review. Dotted arrows 
indicate diffusional uptake, squares possible sites of catalyzed uptake (adapted from 
Gimmler et al. 2002). Numbers refer to enzymes cited in Table 1 (column 1) 
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of higher plants (Rebelle and Douce 1999) and should enable the fungus to 
carry out an efficient Ci unit metabolism. 

Formamide - independent of whether it is taken up as growth substrate 
or is a product of acetamide assimilation - is split into formate and NH3 by 
classical formamidase, causing a second entry into Ci metabolism by a 
formyl-tetrahydrofolate synthetase, which connects to the synthesis of 
purines, glycine, serine, and methionine (Pasternack et al. 1994a,b, 1996; 
Mouillon et al. 1999; Rebeille and Douce 1999; Hanson et al. 2000). When 
Bispora sp. is grown on metal-complexed cyanides, a formamide hydro- 
lyase is supposed to produce formamide, which is metabolized as described 
above. Formamide hydrolyases are high molecular weight proteins (native 
MW between 300 and 1210 kDa; subunits 41-45 kDa; Wang and Vanetten 
1992; Wang et al. 1992; Brown et al. 1995; Dumestre et al. 1997a, b; Barclay 
et al. 1998a, b; compare Table 1). Enzyme activity is generally found only 
under aerobic conditions (which matches the fact that Bispora sp. is obli- 
gate aerobic) and takes HCN, but not CN“ as substrate. For this reason its 
activity is strongly dependent on the pH. It is reasonable to assume that the 
entry of the cyanides into the cells takes place in the form of the lipophilic, 
protonated form. The assimilation of metal-complexed cyanides by the acid 
and metal-resistant fungus makes Bispora sp. a possible candidate for the 
remediation of industrial polluted soils. 



3.4 DMSP and Microbial Ci Metabolism 

Dimethylsulfonioproprionate (DMSP), a common osmolyticum of higher 
plants (Rhodes et al. 1997) and algae (Summers et al. 1998), does not fit 
exactly into the topic of this review, since it does not contain nitrogen. 
Nonetheless, DMSP accumulation is of large environmental and physi- 
ological interest. DMSP biosynthesis begins in the cytosol of higher plants 
and algae and finishes in chloroplasts (Trossat et al. 1996). Its environ- 
mental importance arises from the observation that DMSP is the major 
source of atmospheric dimethylsulfide (DMS), which is a crucial climate 
gas. Its formation is related to microbial Ci metabolism. Some bacteria, 
which are able to grow on DMSP, metabolize DMSP to DMS and acrylate or 
via initial demethylation to methylthioproprionate (MTPA) (Jansen and 
Hansen 2000, 2001).The former reactions follows Eq. (8): 

DMSP OMSP-THF Methyl transferas^ DMS+aCiylate (8) 

This THF methyl transferase exhibits some specificity for DMSP, since 
glycine betaine, which is structurally an N-containing analogue of DMSP, 
does not function as methyl donor for the DMSP-THF methyl transferase. 
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The same applies to various S-containing derivates of DMSP. The physi- 
ological benefit of Eq. ( 8 ) is clear, since DMSP, preferentially of algal origin, 
supports growth of the bacteria containing the DMSP-THF methyl trans- 
ferase. Obscure, however, is the physiological benefit of the stoichiometric 
demethylation of DMSP to MTPA. The (homo)acetogenic fermentation 
based on this demethylation did not support growth of five Eubacterium 
limosum strains, Sporomosa ovata , Sporomosa spaheroides and Acetobac- 
terium woodii. This is in contrast to the analogous demethylation of glycine 
betaine, which strongly supports the growth of these bacteria. 



3.5 Methanol Connects Ci Metabolism of Higher Plants 
and Micro-organisms 

Methanol has been identified beside methane and DMS as one of the major 
organic compounds in forest air and in the troposphere (Nemecek-Mar- 
shall et al. 1995; Fall and Benson 1996). Vast amounts are produced by the 
vegetation, mainly by C3 plants. Two main sources of methanol are dis- 
cussed: 

1 . Demethylation of methylated pectins to polygalacturonans with metha- 
nol as by-product catalyzed in microsomal membranes by a pectin 
methyl transferase (PMT; EC 2 . 1 . 1 ) (Ishikawa et al. 2000; see Eq. 9, 
compare also bottom of Fig. 1): 

Polygalacturonan methyl ester Pe cm methyl, ransfera se> p 0 i yga i act uronan 

+ 11 CH 3 OH (9) 

This demethylation of pectin serves to assist cell maturation (Fall and 
Benson 1996) by providing carboxyl groups in side chains of the pectin 
polymers in the primary layer of the cell walls (preferentially those of 
dicots and gymnosperms). These groups bind divalent cations such as 
calcium and magnesium and thereby support cross-linking of the poly- 
mer chains to a more rigid gel matrix of the primary layer. 

2. Another potential source of biospheric methanol is lignin from the 
secondary layer of cell walls. Lignin is rich in methyl ether groups. Wood 
rotting fungi and a variety of bacteria are capable of enzymatic cleavage 
of these ether bonds leading to volatilization of methanol. 

Independent on its origin, the liberated methanol relates to Ci metabolism 
in completely different pathways: It can to a certain extent stimulate plant 
growth (Fall and Benson 1996). Leaf methanol is rapidly oxidized through 
the intermediate stages of formaldehyde and formate to CO 2 (Fig. 1 ). For- 
mate may feed into the Ci folate pool of the plant and thereby support 
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growth, e.g. by the synthesis of serine, glycine, methionine, purines or 
thymilidates. CO 2 may be utilized by the photosynthetic machinery of the 
leaves. Similar as fungi and bacteria higher plants contain formate dehy- 
drogenase (FDH) required for this process, preferentially in mitochondria 
and non-photosynthetic tissues (Cowan et al. 1998; Hourton-Cabassa et al. 
1998; Suzuki et al. 1998; Igambierdiev et al. 1999; Wingler et al. 1999). Its 
activity is increased by environmental stress such as hypoxia of roots and 
Fe-deficiency (Hourton-Cabassa et al. 1998; Suzuki et al. 1998). It is not 
clear as yet, whether physicochemical and molecular properties of the FDH 
in higher plants are identical with those of the various FDHs in micro-or- 
ganisms. The key enzyme for channelling formate into Ci metabolism of 
higher plants is thought to be similar to that in micro-organisms, i.e. the 
serine hydroxymethyl transferase (SHMT). Additionally it is discussed that 
methanol may induce changes that cause an increase in the availability of 
photorespiratory carbon (Fall and Benson 1996). 

Alternatively leaf methanol, emitted primarily through the stomata, or 
from other sources may serve as carbon source for methylotrophic bacteria 
or fungi, which leads again to Ci metabolism. In more than 70 plant species 
the surface of that/those leaf side(s) which possess stomata is occupied by 
an extensive society of epiphyllic methylotrophic bacteria, that are bacteria 
which can utilize methanol as sole carbon source for growth. 

The first step of the fungal methanol utilization is methanol oxidation 
by the methanol oxidase reaction (MOX, EC. 1.1. 1.1 3, Eq. 10) in fungal 
peroxisomes (Nishida and Erikson 1987; Erikson and Nishida 1988) 

CH 3 OH+O 2 - Meth ^ olo P da l e > CH 2 O+H 2 O 2 (10) 

Formaldehyde is converted to formic acid by a glutathione-dependent 
formaldehyde dehydrogenase (EC 1.2.1. 1, Eq. 11; compare also Eq. 7): 

CH 3 0+glutathione ox - CHOOH+glutathione re d ( 1 1 ) 

The latter reaction is followed eventually by the formate dehydrogenase 
reaction, which converts formate into CO 2 (Eq. 7). It is under discussion, to 
which extent Eqs. (11) and (7) reflect a detoxification mechanism for 
formaldehyde or a pathway for carbon and/or energy utilization under 
conditions of methylotrophic growth of fungi or when co-cultured in the 
presence of other organic growth substrates (compare references cited in 
Table 4). 
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3.6 Halogenated Organic Compounds and Microbial Ci Metabolism 

Halogenated organic compounds (HOCs) are produced in large quantities 
world-wide both industrially (e.g. organochlorine pesticides) and naturally 
(Fetzner 1998). HOCs present an important class of environmental pollut- 
ants, since they are thought to be involved in stratospheric ozone depletion. 
A large proportion of total HOCs is chloromethane (CH3CI). Naturally it 
originates mainly from tropical and subtropical forests (Watling and Har- 
per 1998). About 16,000 t of chloromethane are produced in these forests 
yearly by wood-rotting fungi (Hymnochaetaceae), among them genera 
such as Phellinus or Inanotus (Fetzner 1998; Saxena et al. 1998; Watling and 
Harper 1998). The fungal biosynsythesis of CH3CI is catalyzed by a methyl 
chloride transferase (Saxena et al. 1998). It is suggested that adenoysl-ho- 
mocysteine (SAM) is the methyl donor (compare bottom of Fig. 1), which 
presents a link to Cl metabolism. 

Some bacteria have evolved several strategies for enzyme-catalyzed de- 
halogenation and degradation of both haloaliphatic and haloaromatic com- 
pounds (Fetzner 1998). This is of two-fold environmental interest. On one 
side such systems present detoxification mechanisms for the environment. 
From the bacterial view it presents the utilization of unique organic nutri- 
ents: Some bacteria, e.g., are able to grow with CH3CI as the sole C and 
energy source (Vannelli et al. 1998; Coulter et al. 1999; McAnulla et al. 2001; 
Kayser et al. 2002; Studer et al. 2002). The substrate-inducible chlo- 
romethane dehalogenase (CMD) reaction involves a multi-step pathway 
from CH3CI via formaldehyde to formate (Fetzner 1998; Vannelli et al. 1998; 
Coulter et al. 1999; Studer et al. 1999; Studer et al. 2001) and thus clearly 
presents a link to core Ci metabolism of bacteria. CMD has a molecular 
weight of about 67 kDa and contains cobalamin (vitamin B12 cofactor). 
Recently, the gene of this enzyme has been identified (Studer et al. 1999, 
2001, 2002; McAnulla et al. 2001; Kayser et al. 2002). 

In summary, CH3CI produced naturally by wood-rotting fungi in reac- 
tions related to Ci metabolism is dehalogenated by certain bacteria again 
with the aid of reactions linked with Ci metabolism. 
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Nuclear Magnetic Resonance Applications 
to Low-Molecular Metabolites in Plant Sciences 

Bernd Schneider 



1 Introduction and History 

The basic feature of nuclear magnetic resonance (NMR) spectroscopy is the 
observation of magnetic properties of atomic nuclei and their changes 
under the influence of chemical bonds or adjacent atoms. Although re- 
stricted to atomic nuclei that possess a nuclear magnetic moment, NMR is 
universally applicable to analyze the occurrence of such nuclei in the steady 
state and in dynamic interactions with their chemical environment. Due to 
this general feasibility, after the discovery by Felix Bloch and Edward Purcel 
in 1946, who were awarded the 1952 Nobel Prize in physics, NMR was 
originally established in nuclear physics to accurately determine nuclear 
magnetic moments. After it had been demonstrated that the NMR fre- 
quency depends on the chemical environment (Knight 1949), this tech- 
nique became an interesting tool in chemistry, e.g. for confirming struc- 
tures of synthetic organic compounds. It rapidly expanded into different 
directions and additionally has been applied in various disciplines such as 
material science, medicine, and biology. Improvements in spectrometer 
technology (superconducting magnets, wide-bore magnets, ultra-high- 
field magnets), probe head design, Fourier-transform NMR, computer 
technology and progress in pulse sequences, especially 2D correlation, 
multiple resonance spectroscopy, and pulsed field gradients, further ex- 
tended the possibilities to apply NMR techniques (Fig. 1). A second Nobel 
Prize was awarded to one of the pioneers of NMR spectroscopy, Richard 
Ernst, in 1991 for his contribution to NMR methodology. Biological appli- 
cations allowed elucidation not only of metabolic processes but also of 
structures of biological macromolecules and their dynamic interaction with 
other molecules. The 2002 Nobel Prize in chemistry was awarded in part to 
Kurt Wuthrich for his contribution to determining the three-dimensional 
structures of biological macromolecules in solution by NMR. The present 
review covers various NMR applications, including high-resolution (HR- 
NMR) and magnetic resonance imaging (MRI), in natural product chemis- 
try, biochemistry, and physiology with special emphasis on plants. 
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Fig. 1 . Selected techniques and applications of NMR and MRI in plant sciences 
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2 Identification and Structure Elucidation 

Structure elucidation of natural products has been an important task in 
drug discovery, the search for agricultural chemicals, and identification of 
compounds involved in ecological interspecies interactions. In addition, 
knowledge on the distribution of natural products in the plant kingdom is 
of considerable interest for chemotaxonomy. The majority of the more than 
150,000 secondary metabolites (Steglich et al. 1997), among them a large 
number from plant sources, would not have been structurally elucidated 
without NMR spectroscopy. Natural product chemistry has been a domain 
of ’H-, 13 C-, and 15 N-spectroscopy, proton homonuclear shift-correlation 
NMR spectroscopy ^H^H-COSY, TOCSY) including detection of spatial 
proximities by NOESY and ROESY techniques. In addition, proton-de- 
tected heteronuclear shift-correlation experiments such as HMQC, HSQC, 
HMQC and a multitude of further variants have been developed. For details 
in structure elucidation, the reader is referred to detailed reviews on natural 
product NMR in general (Derome 1989; Sadler 1988; Duddeck et al. 1998; 
Croasmun and Carlson 1994) including specific classes of natural products 
such as alkaloids (Martin and Crouch 1994) and terpenoids (Bohlmann and 
Zeisberg 1975; Ferreira et al. 1998). Coupling of chromatographic separa- 
tion techniques with NMR spectroscopy was first described by Watanabe 
and Niki in 1978 and has been established in recent years as an important 
supplementary methodology in natural product chemistry (Albert et al. 
1999; Schneider 2000; Wolfender et al. 2001). 

Online LC-NMR coupling represents one of the most powerful and 
time-saving methods for the identification and structure elucidation of 
natural products in mixtures. The continuous-flow mode is the most effi- 
cient technique but is restricted to major components of a mixture. It results 
in a pseudo-2D spectrum with a spectroscopic dimension ('H chemical 
shift) and a time dimension (retention time). Stop-flow, loop-sampling, and 
solid phase extraction (SPE)-sampling (Nyberg et al. 2001) are more sensi- 
tive than the continuous-flow mode and, therefore, useful also to record 
*H-NMR spectra of less abundant compounds. Capillary LC-NMR coupled 
to a custom-built microcoil NMR probe (1.1 pi observed volume) has been 
used recently to identify 37 ng of a-pinene in a terpenoid mixture at 
500 MHz (Lacey et al. 2001). In addition to ’H-NMR spectra, various 2D 
spectra can be measured by stop-flow coupling techniques because the 
fraction can remain within the flow cell as long as necessary. UV or DAD 
detection, which usually are used for monitoring the separation of the 
mixture under study, provides absorption data, and coupling with mass 
spectrometry and additional analytical methods is possible as well. LC- 
NMR is especially suitable to solve the increasing dereplication problem in 
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natural product isolation. Thus, several studies employed LC-NMR to 
screen extracts in order to identify new compounds, which subsequently 
have been isolated and subjected to off-line analytical techniques including 
conventional NMR. 

The strategy of using HPLC-NMR coupling in addition to conventional 
NMR spectroscopy is exemplified by the structure elucidation of a phenyl- 
propanoid-derived natural product, anigorootin (Fig. 2). Rhizomes of 
Anigozanthos flavidus (Haemodoraceae) were screened for new natural 
products by means of HPLC-NMR coupling (Holscher and Schneider 
1999). In addition to several known compounds, an unknown natural 
product was detected, the HPLC^H-NMR spectrum (Fig. 2A) of which 
showed signals indicating a compound of the phenylphenalenone type. 
Acetylation of the isolated substance yielded a product showing an acetyl 
signal at 8 2.2 in the *H-NMR spectrum due to acetylation of the equivalent 
aliphatic hydroxyls in the parent compound. In addition, short-term ace- 
tylation gave a second product, which showed a double set of l H signals 
(Fig. 2B) with only one acetyl resonance. From these data a homodimeric 
structure of the new compound was deduced. Heteronuclear correlation 
spectra (HMQC and HMBC) were used to elucidate the core structure and 
to assign all proton and carbon signals of anigorootin. The coupling con- 
stant between the methine protons H-7b and H-14b, which is essential to 
determine the stereochemistry of anigorootin, was obtained from satellite 
doublets in the parent compound and also from doublets of the nonequiva- 
lent methine protons of the monoacetyl derivative (Fig. 2B). The relative 
configuration of the four stereocenters were further established by NOESY 
experiments of the two acetyl derivatives and later confirmed by x-ray 
crystallography (Otalvaro et al. 2002). 

The inherent low sensitivity of NMR spectroscopy is an obstacle, which 
has limited its application in natural product chemistry and plant sciences 
to a certain extent. There is an increasing demand to detect molecules and 
ions in biological samples and to elucidate structures of low-molecular 
compounds and biological macromolecules more sensitively and to enable 
measurement of reduced sample amounts, respectively. Consequently, 
there are several strategies to improve sensitivity of NMR and to overcome 
limitations. Inverse-detection methods in heteronuclear correlation tech- 
niques, the use of pulsed field gradients, and sophisticated pulse sequences 
have significantly contributed to improve performance of NMR spectros- 
copy. Hardware developments such as enhancement of magnetic field 
strength by construction superconducting magnets, ultra-high-field mag- 
nets making use of Joule-Thompson cooling units, miniaturization 
(Crouch and Martin 1992; Olson et al. 1995; Schlotterbeck et al. 2002; 
Wolters et al. 2002), and improvements in probe head design are further 
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Fig. 2. Strategy of structure elucidation of anigorootin from Anigozanthos preissii (Haemo- 
doraceae) using A HPLC^H-NMR coupling and B conventional ^ NMR and 2D tech- 
niques. The arrows in the structure of anigorootin (Ri=R 2 =H) and its acetyl derivatives 
(monoacetyl: Ri=Ac, R 2 =H; diacetyl: Ri=R 2 =Ac) indicate NOESY correlations between 
H-7b/H-14b and the acetyl protons, respectively. For further details, see text 
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important factors in this context. One of the most exciting developments 
in recent years has been the introduction of cryogenic probes (Styles et al. 
1984, 1989). Since thermal noise is proportional to temperature and resis- 
tance of materials used, reduction of one or both of these parameters is 
useful to improve signal-to-noise ratio. Thus, probe heads have been devel- 
oped in which the radio frequency coils and the preamplifiers are cooled to 
very low temperature around or below 20 K while the sample remains at 
room- or even elevated temperature. This leads to reduction of noise by 
factor 3 to 4, corresponding up to 16-fold shorter data acquisition time, to 
obtain comparable results for the same sample in a cryogenic probe com- 
pared to a conventional probe. This high sensitivity of cryogenic NMR 
technology is of special interest for the identification of small samples of 
natural products (Logan et al. 1999; Russell et al. 2000) but is extending to 
other fields of plant sciences as well. The recent availability of cryogenic 
flow probes provides the opportunity to combine the sensitivity of the 
cryoprobe technology with the advantages of LC-NMR hyphenation. 



3 Metabolic Profiling 

NMR-based metabolic profiling provides information which is comple- 
mentary to genomics and proteomics, and in combination with those, can 
be used to develop models describing cellular functions. Standardized 
extraction procedures and NMR analyses of extracts generate datasets, 
which are evaluated by computational methods. 

It has been demonstrated, for example, that a 1D-NMR approach is 
capable of accurately discriminating the mode of action of different herbi- 
cides. Using only a small amount of tissue, this method was able to detect 
minute differences in a plant’s metabolic profile caused by treatment with 
various herbicides. Signals of the herbicide were not visible in the *H-NMR 
spectra since it was applied in low concentration and only the spectral 
manifestation of the variations in the pool of the plant metabolites was 
observed (Aranibar et al. 2001). 

However, in many cases, the occurrence of one or more individual 
metabolites in an extract may be of interest. To make sure that the metabo- 
lite of question is present, a fingerprint pattern of characteristic resonance 
lines in the expected proportions in the ID spectrum may be sufficient. A 
set of characteristic lines often is visible even if the spectrum partially is 
crowded with overlapping signals. Homo- and heteronuclear 2D-NMR 
techniques, preferable in the ^-detected mode, are powerful tools to 
enhance resolution, to detect co-resonating components, and thereby iden- 
tify desired components in a mixture. This approach has been successfully 
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applied, for example, to an algal extract (Fan 1996). Using 2D-TOCSY 
spectra, a variety of amino acids and other metabolites have been identified 
unambiguously in Chlorella without separation of the sample. In addition 
to plant NMR, mass spectrometric techniques provide supplementary in- 
formation in metabolic profiling. A number of components have been 
identified in crude root exudates of different gramineous plants by 2D 
NMR, GC-MS and HR-MS (Fan et al. 2001). Using this approach, the role 
of exudate metal ion ligands in the acquisition of Cd and transition metals 
has been examined. 

Another recent example of metabolomic analysis has shown the conse- 
quences of Cd exposure to a heavy metal-resistant plant, Silene cucubalus 
(Bailey et al. 2003). These examples demonstrate that metabolic profiling is 
capable of detecting the metabolic consequences of altered environmental 
conditions. Further applications are expected in detection of changes of 
metabolite pattern in transgenic plants and mutants in comparison with 
the wild type and, in this context, metabolic profiling will frequently be used 
in future. 



4 NMR in Biosynthesis and Metabolic Flux Analysis 

The question of biosynthetic pathways of natural products has been an 
important area in plant research since radiolabeled precursors became 
available in the 1950s. The possibility to determine the labeling position in 
the target molecules by means of NMR was one of the most important 
reasons which resulted largely in substitution of radiolabel by stable iso- 
topes such as 2 H, 13 C, and 15 N in tracer experiments. The use of labeled 
precursors in biosynthetic studies and aspects of labeling techniques have 
been reviewed several times (Vederas 1985; Simpson 1998; Schneider et al. 
2003). Information on the number of isotopically enhanced atoms and their 
specific position in a molecule is mainly gained from enlarged resonance 
signals, coupling patterns and isotopic-induced changes in the chemical 
shifts. Data obtained from tracer experiments are useful to gain insight into 
precursor-product relationships. Modern developments in NMR spectros- 
copy such as LC-NMR hyphenation (Schmitt and Schneider 2001) and 
cryoprobe technology (Bringmann and Feineis 2001) have already been 
used in sensitive detection of stable isotope labeling in biosynthetic studies. 

A recent 13 C-NMR study is shown as an example (Fig. 3). Incorporation 
of [l- 13 C]phenylalanine into both phenylphenalenones, e.g. anigorufone 
(1) and phenylbenzoisochromenones (phenylnaphthalides) (e.g. com- 
pound 2) in root cultures of Wachendorfia thyrsiflora , a South African 
member of the Haemodoraceae (Opitz and Schneider 2003) confirmed that 
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Fig. 3. Related biosynthesis of oxabenzochrysenones and phenylphenalenones in Wachen- 
dorfia thyrsiflora (Haemodoraceae). The HMQC spectrum of the oxabenzochrysenone 2 
shows enhanced intensity of the cross-signal of H9/C-9 indicating 13 C enrichment of C-9. 
The HMBC spectrum of 2 exhibits enhanced cross-signals of H-9/C-1 and H-3/C-1 (which 
are due to three bond C-H coupling), indicating 13 C enrichment of C-l. The one bond C-H 
correlation of the enriched signal of C-9 and the correlation through two bonds between 
H-8 and C-9 are also visible in the HMBC spectrum and provide additional evidence of 13 C 
enrichment of C-9. Some weak natural abundance cross-signals are visible as well. The 
C-H-heterocorrelation spectra of anigorufone (1) (not shown) exhibited enrichment of 
analogous positions C-6 and C-7 



both types of compounds are formed via a common pathway diverging only 
in late steps. Conclusively, oxidative formation of the oxalactone structure 
of the phenylbenzoisochromenones from the aromatic ring system is very 
likely. A dioxygenase-type mechanism, followed by decarboxylation, has 
been proposed for the key steps of this conversion. 
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In experiments using singly or doubly isotope-labeled precursors only a 
very limited number of isotopomers is observed. The complexity of iso- 
topomer patterns increases with the number of enriched positions and the 
complexity of the pathway, e.g. whether or not bonds between stable iso- 
tope-labeled positions are formed or broken in the course of a biosynthetic 
process. More complex isotopomer mixtures are found after application of 
general metabolites, which are diverted to all branches of intermediary 
metabolism. An isotopomer pattern recognition approach has been devel- 
oped to deduce the origin of secondary products from intermediary meta- 
bolism (Bacher et al. 1999). This retrobiosynthetic method is based on 
comparison between the labeling patterns of secondary metabolites and 
primary metabolites such as amino acids and ribonucleotides. It can be 
used to reconstruct the isotopomer pattern of central metabolic intermedi- 
ates. The biosynthetic retroanalysis has been widely explored in a basic 
biosynthetic issue, namely to discriminate the origin of isoprenoids from 
the classical mevalonate (MVA) pathway and the more recently discovered 
2C-methyl-D-erythritol 4-phosphate (MEP) alias l-deoxy-D-xylulose-5- 
phosphate (DOXP) pathway (Rohmer et al. 1993). 

In a recent example, the biosynthesis of the bitter compound amaro- 
gentin, produced by root cultures of Swertia chirata has been investigated 
(Wang et al. 2001). The results clearly showed that this compound origi- 
nates from a mixed biosynthesis. The sweroside moiety was formed pre- 
dominately via the DOXP route and the biphenylcarboxylic acid moiety 
surprisingly was not biosynthesized via benzoic acid or phenylpropanoid 
precursors but from an early shikimate precursor via a polyketide-type 
biosynthesis. As the DOXP pathway is operative in the plastids, conclusions 
about the compartmentation of the biosynthesis can be drawn indirectly 
from retrobiosynthetic NMR studies. However, the compartmental separa- 
tion is not complete and crosstalk between the mevalonate pathway and the 
DOXP pathway occurs (Lichtenthaler 1999) and may play a role in stress 
situations, and in cell cultures where the architecture of the plant tissue is 
different from that of intact plants. 

In this context, quantitative flux analysis is indispensable to determine 
the contribution of different pathways to a distinct metabolic pool. Since 
the intensity of 13 C-NMR signals largely depends on the relaxation proper- 
ties, spectra of the isotope labeled compound under study and of natural 
abundance reference compounds have to be recorded under identical ex- 
perimental conditions. NOE-based differences in signal intensities are usu- 
ally eliminated by inverse-gated decoupling and, in addition, a relaxation 
reagent such as Cr(acac )3 can be used to determine the absolute enrichment 
of 13 C in specific positions of the molecule. 
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The incorporation of [ 13 C] 1-deoxy-D-xylulose into a variety of terpenes 
of the liverwort Conocephalum conicum has been studied by quantitative 
13 C-NMR spectroscopy. The labeling patterns of the sesquiterpene cube- 
banol, for example, indicated a possible degradation of 1-deoxy-D-xylulose 
to acetate and subsequent incorporation via the mevalonic acid pathway 
(Thiel and Adam 2002). Moreover, in bornyl acetate, the labeling occurred 
only in the acetate moiety whereas the isoprene units remained unlabelled. 
Different levels of enrichment of the labeled positions of cubebanol and the 
acetate residue of bornyl acetate were found, which imply that different 
acetyl CoA pools are utilized for the acetylation of borneol and the IPP 
biosynthesis of cubebanol via the mevalonic acid pathway. From these data, 
it was assumed that the two reactions occur most probably in different 
cellular compartments or cell types of the plant. 

Many investigations have shown that biosynthetic pathways frequently 
are not simple linear precursor-product relationships but in reality repre- 
sent rather complex networks, which enable formation of products via 
different subroutes. Moreover, depending on various endogenous and ex- 
ogenous factors, plants are able to use alternative pathways in a flexible way. 
Labeling patterns can be used to reveal quantitative aspects of carbon flux 
in the metabolic network of the organism under investigation. Conse- 
quently, in the postgenomic era, the quantitative analysis of complex me- 
tabolic flux patterns is becoming more and more important (Szyperski 
1998; Roberts 2000; Roscher et al. 2000; Weichert 2001). This approach 
helps to understand how the genome determines plant growth, develop- 
mental and defense processes through the regulation of the metabolic 
network. The availability of molecular tools and the detailed knowledge of 
the genetic basis of metabolic processes offer possibilities for the manipu- 
lation and engineering of metabolic pathways such as the transfer of plant- 
specific biosynthetic genes to other plants or microbial producers and even 
the design of artificial natural products (Dixon 2001; Sato et al. 2001; Lee 
and Schmidt-Dannert 2002). NMR spectroscopy is an important tool in 
monitoring consequences and many other aspects of metabolic engineering 
of plants (Shachar-Hill 2002). 

In biotechnology, the cellular metabolism of a particular organism can 
be rationally redesigned to produce desired products in expense of others, 
which are unwanted. The consequences of altering the metabolic flux 
through the benzophenanthridine alkaloid pathway, for example, were 
evaluated by two-dimensional ^-COSY-NMR spectroscopy (Park et al. 
2003). In this study, changes in metabolic flux through benzylisoquinoline 
alkaloid biosynthesis were shown to affect the regulation of amino acid 
pools. 
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Isotopomer analysis, which represents an essential prerequisite for me- 
tabolic flux analysis, is often a challenge. A ^-NMR method has been 
developed which allows the determination of the complete isotopomer 
distribution in metabolites having a backbone consisting of up to at least 
four carbons (deGraaf et al. 2000). In a first application, the complete 
isotopomer distribution of aspartate isolated from an [l- 13 C]ethanol- 
grown microorganism, Ashbya gossypii , was determined. The approach 
certainly will be useful in plant studies as well. 

In ecology research it is particularly interesting how plants, attacked by 
predators and herbivores, defend themselves by the production of deterrent 
chemicals or enhancement of mechanical resistance. These costly defense 
strategies often require dramatic reorientation of metabolic fluxes. To 
understand the ecological function of low-molecular defense compounds 
and the biosynthetic networks by which they are formed, requires compara- 
tive quantitative flux analysis of the processes found in wild-type plants 
with those found in mutants, or which have been designed by recombinant 
DNA technology. 

The web of metabolic processes involved in the balance between plant 
growth and reproduction, and defense strategies, on the other hand, is 
characterized by competition of divergent pathways for the same key 
intermediates or alternatively, biosynthetic pathways can contribute to one 
product. Quantitative analysis of isotope enrichment of metabolites by 
NMR spectroscopy enables determination of fluxes through a distinct 
pathway, i.e. the extent by which a certain metabolite is synthesized through 
alternative routes can be deduced quantitatively. This approach implies 
both spectroscopic skills and knowledge of mathematical data evaluation 
methods. The application of different strategies to determine metabolic 
fluxes quantitatively has been discussed in detail using an example from 
intermediary plant metabolism, the pathways interconverting hexose phos- 
phates and triose phosphates (Roscher et al. 2000). 



5 In Vivo NMR Spectroscopy 

As a non-invasive technique, in-vivo plant NMR spectroscopy provides 
information on molecules or ions in plant tissue under physiological, or 
near physiological, conditions. In-vivo NMR spectroscopy has been applied 
to plant systems for many different purposes (Ratcliffe et al. 2001). Numer- 
ous publications aimed at the detection of 1 H, 13 C, 15 N, and 31 P in living 
plant tissue and several other nuclei have been employed in addition 
(Martin 1985; Ratcliffe 1994) in in-vivo-NMR experiments. Steady- state 
measurements of NMR-detectable inorganic ions (e.g. ammonium, nitrate, 
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sodium, phosphate) and low-molecular plant metabolites (e.g. carbohy- 
drates, amino acids, organic acids, triglycerides) can be performed to 
determine intracellular pool sizes, compartmentation, or pH (Lundberg et 
al. 1990; Ratcliffe 1996). 

For example, the subcellular compartmentation of homoserine in Acer 
pseudoplatanus cells (Aubert et al. 1998) and proline in salt-stressed leaf 
pieces of Pringlea antiscorbutica (Aubert et al. 1999) has been studied by in 
vivo 13 C and 31 P NMR. The technique developed by these authors involved 
NMR analysis at pH 6.0 followed by alkalinization of the perfusion medium 
using NH 4 + . Under these conditions certain carbon resonances of cytosolic 
amino acids were shifted selectively to lower field and the signals were 
resolved in the spectra to two distinct peaks corresponding to the vacuolar 
and cytoplasmic pools of homoserine and proline, respectively. In Pringlea 
antiscorbutica, proline mainly accumulated in the cytoplasm with concen- 
trations 2-3 times higher in comparison to that in the vacuole, underlining 
its role as cytoplasmic osmoregulator in this plant. 

The possibility to gain information about dynamic processes is one of 
the most important advantages of in-vivo NMR spectroscopy. This possi- 
bility is due to the non-destructive character of this technique, i.e. it is 
possible to analyze the same sample many times under controlled constant 
or changing external conditions. This aspect is of special value to elaborate 
dynamic physiological processes in plants. Consequently, kinetic in-vivo 
NMR studies have been undertaken in energy metabolism, glycolysis, 
amino acid metabolism, Krebs cycle, and other areas of intermediary me- 
tabolism (Ratcliffe 1996). 

Both in-vivo and in-vitro NMR techniques have been employed to study 
various functional aspects of ectomycorrhiza and arbuscular mycorrhiza, 
such as phosphorus nutrition, nitrogen and carbon metabolism, and xeno- 
biotics and secondary metabolism (Pfeffer et al. 2001). Another area of 
application of in-vivo NMR is the adaptation of plant metabolism to specific 
environmental conditions. The pathways by which glycine and serine might 
be utilized as nitrogen sources for the aquatic resurrection plant Chamaegi- 
gas intrepidus were investigated by in-vivo NMR of roots incubated with 
[ 15 N]glycine, [ 15 N]serine and [2- 13 C]glycine. The results were consistent 
with the involvement of the glycine decarboxylase complex and serine 
hydroxymethyltransferase in the utilization of glycine (Hartung and Rat- 
cliffe 2002). 

In-vivo NMR studies on the biosynthesis of secondary metabolites have 
been conducted rather rarely because in general their concentrations are 
lower in comparison with intermediary metabolites. Thus, suitable plant 
systems are required which do contain detectable amounts of the metabo- 
lites of interest or are able to form them from exogenously supplied precur- 
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sors. In addition to abundance of the metabolite, obstacles such as suscep- 
tibility differences in the tissue due to air spaces and paramagnetic ions in 
the medium have to be taken into account. Examples of in-vivo NMR 
studies on secondary metabolites include [ 15 N]tropinone metabolism in 
transformed root cultures of Datura stramonium (Ford et al. 1996) and the 
conversion of monoterpenoid indole alkaloids in cell suspensions of Rau- 
volfia serpentina (Hinse et al. 2001). Remarkably, the latter investigation 
was carried out with unlabeled precursors. Vellosimine (final concentra- 
tion approximately 1 mM) was fed to suspended cells and, in the first step, 
rapid reduction of the aldehyde group to give vellosimol was observed with 
a high-field spectrometer (800 MHz). In another step, hydroxylation in the 
aromatic ring of the indole moiety of vellosimol resulted in sarpagine. 
Assignment of the indole alkaloid metabolites was performed on the basis 
of cross signals in the aldehyde and aromatic region of gradient-assisted 
HSQC spectra, indicating the advantages of using 2D techniques in in-vivo 
NMR studies. In addition to assignment information and improved spectral 
resolution by the second dimension, heteronuclear 2D spectra acquired in 
the inverse-mode provide enhanced sensitivity of 'H for the detection of 
heteronuclei such as 13 C or 15 N in comparison with direct detection tech- 
niques. 

Figure 4 shows a gradient-assisted HMQC spectrum of root cultures of 
Anigozanthos preissii supplied with [2- 13 C]cinnamic acid. Although the 
nutrient medium was substituted with sugar- free medium for the measure- 
ment, strong natural abundance signals of the carbohydrates are visible in 
the upper right part of the spectrum. The cross resonances in the lower left 
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Fig. 4. In vivo NM^ spectrum of root cultures of Anigozanthos preissii (Haemodoraceae) 
after feeding of [2- C] cinnamic acid 
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